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NOVEL GENES ENCODING WHEAT STARCH SYNTHASES 
AND USES THEREFOR 

FIELD OF THE INVENTION 

5 The present invention relates generally to isolated nucleic acid molecules encoding 
wheat starch synthase enzymes and more particularly, to isolated nucleic acid 
molecules that encode wheat SSII and SSIII enzyme activities. The isolated nucleic 
acid molecules provide the means for modifying starch content and composition in 
plants, for example the ratio of amylose:amylopectin in the starch granule of the 

10 endosperm during the grain-filling phase of endosperm development. The isolated 
nucleic acid molecules of the present invention also provide the means for screening 
plant lines to determine the presence of natural and/or induced mutations in starch 
synthase genes which affect starch content and/or composition. The isolated nucleic 
acid molecules of the present invention further provide for the screening-assisted 

15 breeding of plants having desirable starch content and/or composition, in addition to 
providing for the direct genetic manipulation of plant starch content and/or composition. 

GENERAL 

Bibliographic details of the publications numerically referred to in this specification are 
20 collected at the end of the description. Reference herein to any published document 
is not to be taken as an indication or admission that any such published document is 
part of the common general knowledge or background information of a skilled worker 
in the relevant field. 

25 This specification contains nucleotide and amino acid sequence information (SEQ ID 
NOS:) prepared using the programme Patentln Version 2.0, presented herein at the 
end of the specification. Each nucleotide or amino acid sequence is identified in the 
sequence listing by the numeric indicator <210> followed by the sequence identifier 
(e.g. <210>1 , <210>2, etc). The length, type of sequence (DNA, protein (PRT), etc) 

30 and source organism for each nucleotide or amino acid sequence are indicated by 
information provided in the numeric indicator fields <211>, <212> and <213>, 
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respectively. Nucleotide and amino acid sequences (SEQ ID NOs:) referred to in the 
specification are defined by the information provided in numeric indicator field <400> 
followed by the sequence identifier (eg. SEQ ID NO: 1 is <400>1 , etc). 

5 The designation of nucleotide residues referred to herein are those recommended by 
the IUPAC-IUB Biochemical Nomenclature Commission, wherein A represents 
Adenine, C represents Cytosine, G represents Guanine, T represents thymine, Y 
represents a pyrimidine residue, R represents a purine residue, M represents Adenine 
or Cytosine, K represents Guanine or Thymine, S represents Guanine or Cytosine, W 
10 represents Adenine or Thymine, H represents a nucleotide other than Guanine, B 
represents a nucleotide other than Adenine, V represents a nucleotide other than 
Thymine, D represents a nucleotide other than Cytosine and N represents any 
nucleotide residue. 

15 The designations for naturally-occurring amino acid residues referred to herein are set 
forth in Table I. The designations for a non-limiting set of non-naturally-occurring amino 
acids is listed in Table 2. 

As used herein the term "derived from" shall be taken to indicate that a specified 
20 integer may be obtained from a particular source albeit not necessarily directly from 
that source. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
25 imply the inclusion of a stated step or element or integer or group of steps or elements 
or integers but not the exclusion of any other step or element or integer or group of 
steps or elements or integers. 
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TABLE 1 



Amino Acid Three-letter Code One-letter Code 



5 Alanine Ala A 

Afginine Arg R 

Asparagine Asn N 

Asparticacid Asp D 

Cysteine Cys C 

10 Glutamine Gin Q 

Glutamic acid Glu E 

Glycine Gly G 

Histidine His H 

Isoleucine He I 

15 Leucine Leu L 

Lysine Lys K 

Methionine Met M 

Phenylalanine Phe F 

Proline Pro P 

20 Serine Ser S 

Threonine Thr T 

Tryptophan Trp W 

Tyrosine Tyr Y 

Valine Val V 

25 Aspartate/glutamate Baa B 
Asparagine/glutamine 

Any amino acid as above Xaa X 
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TABLE 2 



Non-conventional Code Non-conventional Code 

amino acid amino acid 



5 



a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methylarginine 


Nmarg 


aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


carboxylate 




L-N-methylaspartic acid 


Nmasp 


10 aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 


aminonorbornyl- 


Norb 


L-N-methylglutamine 


Nmgln 


carboxylate 




L-N-methylglutamic acid 


Nmglu 


cyclohexylalanine 


Chexa 


L-N-methylhistidine 


Nmhis 


cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 


15 D-alanine 


Dal 


L-N-methylleucine 


Nmleu 


D-arginine 


Darg 


L-N-methyllysine 


Nmlys 


D-aspartic acid 


Dasp 


L-N-methylmethionine 


Nmmet 


D-cysteine 


Dcys 


L-N-methylnorleucine 


Nmnle 


D-glutamine 


Dgln 


L-N-methylnorvaline 


Nmnva 


20 D-glutamic acid 


Dglu 


L-N-methylornithine 


Nmorn 


D-histidine 


Dhis 


L-N-methylphenylalanine 


Nmphe 


D-isoleucine 


Dile 


L-N-methylproline 


Nmpro 


D-leucine 


Dleu 


L-N-methylserine 


Nmser 


D-lysine 


Dlys 


L-N-methylthreonine 


Nmthr 


25 D-methionine 


Dmet 


L-N-methyltryptophan 


Nmtrp 


D-ornithine 


Dora 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N-methylvaline 


Nmval 


D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 


D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 


30 D-threonine 


Dthr 


L-norleucine 


Nle 


D-tryptophan 


Dtrp 


L-norvaline 


Nva 
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D-tyrosine 


Dtyr 


a-methyl-aminoisobutyrate 


Maib 


D-valine 


Dval 


a-methyl-Y-aminobutyrate 


Mgabu 


D-a-methylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


D-a-methylarginine 


Dmarg 


a-methylcylcopentylalanine 


Mcpen 


5 D-a-methylasparagine 


Dmasn 


a-methyl-a-napthylalanine 


Manap 


D-a-methylaspartate 


Dmasp 


cc-methylpenicillamine 


Mpen 


D-a-methylcysteine 


Dmcys 


N-(4-aminobutyl)glycine 


Nglu 


D-a-methylglutamine 


Dmgln 


N-(2-aminoethyl)glycine 


Naeg 


D-a-methylhistidine 


Dmhis 


N-(3-aminopropyl)glycine 


Norn 


10 D-a-methylisoleucine 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 


D-a-methylleucine 


Dmleu 


a-napthylalanine 


Anap 


D-cc-methyllysine 


Dmlys 


N-benzylglycine 


Nphe 


D-a-methylmethionine 


Dmmet 


N-(2-carbamylethyl)glycine 


Ngln 


D-a-methylornithine 


Dmorn 


N-(carbamylmethyl)glycine 


Nasn 


15 D-a-methylphenylalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 


D-a-methylproline 


Dmpro 


N-(carboxymethyl)glycine 


Nasp 


D-a-methylserine 


Dmser N-cyclobutylglycine 


Ncbut 


D-a-methylthreonine 


Dmthr 


N-cycloheptylglycine 


Nchep 


D-a-methyltryptophan 


Dmtrp 


N-cyclohexylglycine 


Nchex 


20 D-a-methyltyrosine 


Dmty 


N-cyclodecylglycine 


Ncdec 


D-a-methylvaline 


Dmval 


N-cylcododecylglycine 


Ncdod 


D-N-methylalanine 


Dnmala 


N-cyclooctylglycine 


Ncoct 


D-N-methylarginine 


Dnmarg 


N-cyclopropylglycine 


Ncpro 


D-N-methylasparagine 


Dnmasn 


N-cycloundecylglycine 


Ncund 


25 D-N-methylaspartate 


Dnmasp 


N-(2,2-diphenylethyl) 








glycine 


Nbhm 


D-N-methylcysteine 


Dnmcys 


N-(3 ,3-diphenylpropy I) 








glycine 


Nbhe 
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D-N-methylglutamine 


Dnmgln 


N-(3-guanidinopropyl) 








glycine 


Narg 


D-N-methylglutamate 


Dnmglu 


N-( 1 -hydroxyethy Oglycine 


Nthr 


D-N-methylhistidine 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 


5 D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyO) 








glycine 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl) 








glycine 


Nhtrp 


D-N-methyllysine 


Dnmlys 


N-methyl-y-aminobutyrate 


Nmgabu 


10 N-methylcyclohexylalanine 


Nmchexa 


D-N-methylmethionine 


Dnmmet 


D-N-methylornithine 


Dnmorn 


N-methylcyclopentylalanine 


Nmcpen 


N-methylglycine 


Nala 


D-N-methylphenylalanine 


Dnmphe 


N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 


N-(l-methylpropyl)glycine 


Nile 


D-N-methylserine 


Dnmser 


15 N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 


D-N-methyltryptophan 


Dnmtrp 


N-( 1 -methylethy Oglycine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 


y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine Nhtyr 


20 L-/-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamine 


Pen 


L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 


L-a-methylarginine 


Marg 


L-cc-methylasparagine 


Masn 


L-a-methylaspartate 


Masp 


L-a-methyl-/-butylglycine 


Mtbug 


25 L-oc-methylcysteine 


Mcys 


L-methylethylglycine 


Metg 


L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 


L-oc-methylhistidine 


Mhis 


L-a-methylhomo 








phenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl) 




30 




glycine 


Nmet 


L-a-methylleucine 


MIeu 


L-cc-methyllysine 


Mlys 
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L-cc-methylmethionine Mmet 

L-a-methylnorvaline Mnva 

L-a-methylphenylalanine Mphe 

L-a-methylserine Mser 

5 L-a-methyltryptophan Mtrp 

L-a-methylvaline Mval 

N-(N-(2,2-diphenylethyl) 

carbamylmethyOglycine Nnbhm 
1 0 1 -carboxy- 1 -(2,2-diphenyl- 

ethylamino)cyclopropane Nmbc 



L-oc-methylnorleucine Mnle 

L-a-methylornithine Morn 

L-a-methylproline Mpro 

L-a-methylthreonine Mthr 

L-a-methyltyrosine Mtyr 
L-N-methylhomo 

phenylalanine Nmhphe 
N-(N-(3 ,3-dipheny lpropy 1) 

carbamylmethyOglycine Nnbhe 



Those skilled in the art will appreciate that the invention described herein is susceptible 
15 to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and all 
combinations or any two or more of said steps or features. 

20 

The present invention is not to be limited in scope by the specific embodiments 
described herein, which are intended for the purposes of exemplification only. 
Functionally-equivalent products, compositions and methods are clearly within the 
scope of the invention, as described herein. 

25 

BACKGROUND TO THE INVENTION 

The biosynthesis of the starch granule is a complex process which involves the action 
of an array of isoforms of enzymes involved in the starch biosynthesis. Following the 
formation of glucose-1 -phosphate, the enzyme activities required for the synthesis of 
30 granular starch include ADP glucose pyrophosphorylase (EC 2.7.7.27), starch 
synthases (EC 2.4.1 .21), branching enzymes (EC 2.4.1 .18) and debranching enzymes 
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(EC 3.2.1 .41 and EC 3.2.1 .68) (Mouille et al., 1996). Plants contain isozymes of each 
of these activities, and the definition of these isoforms and their roles has been 
conducted through investigation of the properties of the suite of soluble enzymes found 
in the stroma of the plastid, analysis of the proteins entrapped within the matrix of the 
5 starch granule, and mutational studies to identify genes and define linkages between 
individual genes and their specific roles. , 

Starch synthases extend regions of a-1 ,4 glucan through the transfer of the glucosyl 
moiety of ADPglucose to the non-reducing end of a pre-existing a-1 ,4 glucan. In 

10 addition to GBSS, 3 other classes of starch synthase have been identified in plants, 
SSI (wheat, Li et al., 1999 and GenBank Accession No. U48227; rice, Baba ef a/., 
1993; potato, Genbank Accession No. STSTASYNT), SSII (pea, Dry et al. 1992; 
potato. Edwards et al., 1995; maize, Ham et al. 1998 and GenBank Accession No. 
U66377) and SSIII (potato, Abel etal, 1996; maize, Gao etal., 1998). In the cereals, 

15 the most comprehensively studied species is maize, where in addition to GBSS, 
cDNAs encoding SSI, SSIIa, and SSIIb have been isolated, and both cDNA and 
genomic clones for dulft have been characterised (Knight et al., 1998; Harn ef al., 
1998; Gao et al., 1998). In maize, the product of the du1 gene is known as maize 
SSII, however this gene is the homologue of potato SSIII. 

20 

The proteins within the matrix of the wheat starch granule have been extensively 
studied (Denyer ef al., 1995; Rahman ef al., 1995; Takaoka ef al., 1997; Yamamori 
and Endo, 1996) and 60, 75, 85, 100, 104 and 105 kDa protein bands can be 
visualised following SDS-PAGE. The predominant 60 kDa protein is exclusively 

25 granule-bound and is analogous to the "waxy" granule bound starch synthase (GBSS) 
gene in maize (Rahman ef al., 1995). The combination of three null alleles for this 
enzyme from each of the wheat genomes (Nakamura et al., 1995) results in the 
amylose-free "waxy" phenotype found in other species The 75 kDa starch synthase I 
(wSSI) is found in both the granule and the soluble fraction of wheat endosperm 

30 (Denyer et al., 1995; Li et al., 1999) and has been assigned to chromosomes 7A, 7B 
and 7D (Yamamori and Endo. 1996; Li ef al., 1999). The 85 kDa band contains a 
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class II branching enzyme and an unidentified polypeptide (Rahman et ai, 1995). The 
100, 104 and 105 kDa proteins of the wheat starch granule (designated Sgp-B1 , Sgp- 
D1 and Sgp-A1 by Yamamori and Endo, 1996) have been shown to be encoded by a 
homeologous set of genes on the short arm of chromosome 7B, 7A and 7D 
5 respectively (Yamamori and Endo, 1996; Takaoka et ai, 1997). Denyer et ai (1995) 
concluded on the basis of enzyme activity assays that these proteins were also starch 
synthases. These genes are referred to hereinafter as the "wheat SSI I genes". 

While GBSS has been established to be essential for amylose synthesis, the remaining 
10 starch synthases are thought to be primarily responsible for the elongation of 
amylopectin chains, although this does not preclude them from also having non- 
essential roles in amylose biosynthesis. Differences in kinetic properties between 
isoforms, and the analysis of mutants lacking various isoforms, suggests that each 
isoenzyme contributes to the extension of specific subsets of the available non- 
15 reducing ends. 

SUMMARY OF THE INVENTION 

The production of plants that produce improved starches that are modified for 
particular end-use applications, such as, for example, starches having high or low 
20 amylose:amylopectin ratios, requires the availability of genes encoding the various 
starch synthase isoforms. Because of species-specific codon usages, and variations 
in the kinetic parameters of the starch synthase isoforms between species, the 
production of modified starches may require the use of genes derived from particular 
species. 

25 

Furthermore, the screening-assisted breeding of plants having desirable starch content 
and/or composition requires specific gene sequences to be provided that can be used 
to distinguish between different homeologous genes encoding the various isoforms of 
wheat starch synthases, such as, for example, to identify and distinguish between 
30 naturally-occurring variant gene sequences. It is a particular object of the present 
invention to provide gene sequences to facilitate the screening-assisted selection of 
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wheat plants having starch traits which are associated with the presence and/or 
expression of one or more wheat SSI and/or SSIII genes. 

Accordingly, the present invention provides isolated nucleotide sequences encoding 
5 the wheat SSII (i.e. wSSII) and wheat SSIII (i.e. wSSIII) isoenzymes, and DNA markers 
derived therefrom. The present invention further facilitates the production of 
transformed plants carrying these nucleotide sequences. 

More particularly, the present invention provides isolated nucleic acid molecules 
10 encoding the 100, 104 and 105 kDa SSII (Sgp-1) polypeptides of the wheat starch 
granule matrix, as determined using the SDS/PAGE system of Rahman et al. (1995), 
which polypeptides are equivalent to the 100, 108 and 1 15 kDa polypeptides described 
by Yamamori and Endo (1996). 

15 The present invention further provides isolated nucleic acid molecules encoding the 
soluble du//1-type wheat starch synthase III polypeptide. Analysis of the polypeptides 
encoded by these nucleic acid molecules reveals several consensus amino acid 
sequence motifs that are highly conserved in wheat starch synthase isoenzymes, in 
addition to isoenzyme-specific sequences, which sequences possess utility in isolating 

20 related starch synthase-encoding sequences and in assaying plants for their 
expression of one or more starch synthase isoenzymes. 

Accordingly, one aspect of the present invention provides an isolated nucleic acid 
molecule which comprises a sequence of nucleotides which encodes, or is 
25 complementary to a nucleic acid molecule which encodes a wheat starch synthase 
polypeptide, protein or enzyme molecule or a functional subunit thereof selected from 
the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
30 least about 85% identical overall to an amino acid sequence set forth in any one 

of SEQ ID NOS: 2, 4, or 6; 
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(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 8 or 10; 

5 (iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS (SEQ ID NO: 39); 

10 (b) GHTVEVILPKY (SEQ ID NO: 40); 

(c) HDWSSAPVAWLYKEHY (SEQ ID NO: 41 ); 

(d) GILNGIDPDIWDPYTD (SEQ ID NO: 42); 

(e) DVPIVGIITRLTAQKG (SEQ ID NO: 43); 

(f) NGQWLLGSA (SEQ ID NO: 44); 

15 (g)AGSDFIIVPSIFEPCGLTQLVAMRYGS (SEQ ID NO: 45); and 

(h)TGGLVDTV (SEQ ID NO: 46); 
wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
sequence set forth in any one of SEQ ID NOS: 2, 4, 6, 8 or 10; and 

20 (iv) a wheat starch synthase polypeptide, protein or enzyme or functional 

subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KTGGLGDVAGA (SEQ ID NO: 47); 

25 (b) GHRVMVWPRY (SEQ ID NO: 48); 

(c) NDWHTALLPVYLKAYY (SEQ ID NO: 49); 

(d) GIVNGIDNMEWNPEVD (SEQ ID NO: 50); 

(e) DVPLLGFIGRLDGQKG (SEQ ID NO: 51 ); 

(f) DVQLVMLGTG (SEQ ID NO: 52); 

30 (g)AGADALLMPSRF(E/V)PCGLNQLYAMAYGT (SEQ ID NO: 53); and 

(h)VGG(V/L)RDTV (SEQ ID NO: 54); 
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wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
sequence set forth in any one of SEQ ID NOS: 2, 4, 6, 8 or 10. 

5 In a preferred embodiment, the isolated nucleic acid molecule encodes a starch 
synthase polypeptide, protein or enzyme having at least about 90% amino acid 
sequence identity to any one of SEQ ID NOS: 2,4, 6, 8 or 1 0, more preferably having 
at least about 95% or about 97% or about 99% identity to any one of said amino acid 
sequences. 

10 

In an alternative embodiment, the isolated nucleic acid molecule of the present 
invention encodes a wheat starch synthase polypeptide which comprises one or more 
amino acid sequences selected from the group consisting of: 
(a) GHTVEVlLPKY; 
15 (b) HDWSSAPVAWLYKEHY; 

(c) DVPIVGIITRLTAQKG; 

(d) NGQWLLGSA; 

(e) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(f) TGGLVDTV; 

20 (g) GIVNGIDNMEWNPEVD; and 

(h) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT. 

in an alternative embodiment, the present invention provides an isolated nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
25 molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence having at least about 85% nucleotide sequence identity to any 
one of SEQ ID NOS: 1, 3, 5, 7, 9,11-16. 37 or 38 or a complementary nucleotide 
sequence thereto. 

30 In a preferred embodiment, the isolated nucleic acid molecule comprises the 
nucleotide sequence set forth in any one of SEQ ID NOS: 1 , 3, 5, 7, 9,1 1-1 6, 37 or 38, 
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or is at least about 90% identical, more preferably at least about 95% or 97% or 99% 
identical to all or a protein-encoding part thereof. 

In an alternative embodiment, the present invention provides an isolated nucleic acid 
5 molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence that is capable of hybridising under at least moderate 
stringency hybridisation conditions to at least about 30 contiguous nucleotides derived 
from any one of SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 37 or 38, or a complementary 
10 nucleotide sequence thereto. 

A second aspect of the present invention provides a method of isolating a nucleic acid 
molecule that encodes a starch synthase polypeptide, protein or enzyme described 
supra, said method comprising: 
15 (j) hybridising a probe or primer comprising at least about 15 contiguous 

nucleotides in length derived from any one of SEQ ID NOS: 1, 3, 5, 7, 9,1 1-16, 
37 or 38, or a complementary nucleotide sequence thereto to single-stranded 
or double-stranded mRNA, cDNA or genomic DNA; and 
(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
20 means. 

Preferably, the detecting means is a reporter molecule covalently attached to the probe 
or primer molecule or alternatively, a polymerase chain reaction format. Accordingly, 
the present invention clearly extends to the use of the nucleic acid molecules provided 
25 herein to isolate related starch synthase-encoding sequences using standard 
hybridisation and/or polymerase chain reaction techniques. 

A third aspect of the invention provides an isolated probe or primer comprising at least 
about 1 5 contiguous nucleotides in length derived from any one of SEQ ID NOS: 1 , 3, 
30 5, 7, 9,1 1-16, 37 or 38, or a complementary nucleotide sequence thereto. 
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Preferably, the probe or primer comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 25 to 34. 

A fourth aspect of the present invention is directed to an isolated or recombinant starch 
5 synthase polypeptide, protein or enzyme, preferably substantially free of conspecific 
or non-specific proteins, which comprises an amino acid sequence selected from the 
following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 

1 0 least about 85% identical overall to an amino acid sequence set forth in any one 

of SEQ ID NOS: 2, 4, or 6; 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 

15 of SEQ ID NOS: 8 or 10; 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

20 (a) KVGGLGDWTS; . 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
25 (f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV 

wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
30 sequence set forth in any one of SEQ ID NOS: 2, 4, 6, 8 or 10; and 

(iv) a wheat starch synthase polypeptide, protein or enzyme or functional 



WO 00/66745 



PCT/AU00/00385 



-15- 

subunit thereof which comprises a conserved amino acid sequence having at 
least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KTGGLGDVAGA; 
5 (b) GHRVMVWPRY; 

(c) NDWHTALLPVYLKAYY; 

(d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 

(f) DVQLVMLGTG; 

10 (g)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h)VGG(V/L)RDTV 

wherein said wheat starch synthase polypeptide further comprises an amino 
acid sequence having at least about 85% identity overall to an amino acid 
sequence set forth in any one of SEQ ID NOS: 2, 4, 6, 8 or 1 0. 

15 

The present invention clearly encompasses the mature protein region of a wheat 
starch synthase polypeptide which is obtained by removal of the N-terminal transit 
peptide sequence. 

20 A further aspect of the invention provides a method of assaying for the presence or 
absence of a starch synthase isoenzyme or the copy number of a gene encoding same 
in a plant, comprising contacting a biological sample derived from said plant with an 
isolated nucleic acid molecule derived from any one of SEQ ID NOS 1 , 3, 5, 7, 9,1 1 - 
16, 37 or 38, or any one of SEQ ID NOS: 25 to 34, or a complementary nucleotide 

25 sequence thereto for a time and under conditions sufficient for hybridisation to occur 
and then detecting said hybridisation using a detection means. 

The detection means according to this aspect of the invention is any nucleic acid 
based hybridisation or amplification reaction. 

30 

A further aspect of the present invention utilises the above-mentioned assay method 
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in the breeding and/or selection of plants which express or do not express particular 
starch synthase isoenzymes or alternatively, which express a particular starch 
synthase isoenzyme at a particular level in one or more plant tissues. This aspect 
clearly extends to the selection of transformed plant material which contains one or 
5 more of the isolated nucleic acid molecules of the present invention. 

A further aspect of the present invention provides a method of modifying the starch 
content and/or starch composition of one or more tissues or organs of a plant, 
comprising expressing therein a sense molecule, antisense molecule, ribozyme 

10 molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: 1 , 3, 5, 7, 
9,11-16, 37 or 38, or a complementary nucleotide sequence thereto for a time and 
under conditions sufficient for the enzyme activity of one or more starch synthase 
isoenzymes to be modified. This aspect of the invention clearly extends to the 

15 introduction of the sense molecule, antisense molecule, ribozyme molecule, co- 
suppression molecule, or gene-targeting molecule to isolated plant cells, tissues or 
organs or organelles by cell fusion or transgenic means and the regeneration of intact 
plants therefrom. 

20 A further aspect of the present invention provides an isolated promoter that is operable 
in the endosperm of a monocotyledonous plant cell, tissue or organ, and preferably in 
the endosperm of a monocotyledonous plant cell, tissue or organ. For example, the 
HMG promoter from wheat, or the maize zein gene promoter are particularly preferred, 
as is the promoter derived from a starch synthase gene of the present invention, such 

25 as a promoter that is linked in vivo to any one of SEQ ID NOS 1 , 3, 5, 7, 9,1 1-16, 37 
or 38, or a complementary nucleotide sequence thereto. 

A still further aspect of the present invention contemplates a transgenic plant 
comprising an introduced sense molecule, antisense molecule, ribozyme molecule, co- 
30 suppression molecule, or gene-targeting molecule having at least about 85% 
nucleotide sequence identity to any one of any one of SEQ ID NOS: 1 , 3, 5, 7, 9,1 1 -1 6, 
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37 or 38, or a complementary nucleotide sequence thereto or a genetic construct 
comprising same, and to plant propagules, cells, tissues, organs or plant parts derived 
from said transgenic plant that also carry the introduced molecule(s). 

5 BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a copy of a photographic representation showing the distribution of wheat 
endosperm starch synthases between the starch granule and soluble fractions. Lane 
1 , SDS-PAGE of wheat endosperm starch granule proteins revealed by silver staining; 
lanes 2-7, immunoblot of wheat endosperm soluble phase and starch granule proteins 

10 separated by SDS-PAGE from various developmental stages and probed with an anti- 
(wheat wSSII peptide) monoclonal antibody. Lanes 2-4 contain proteins from the 
soluble fraction of wheat endosperm at 15 days post anthesis (Lane 2); 20 days post 
anthesis (Lane 3); and at 25 days post anthesis (Lane 4). Lanes 5-7 contain proteins 
from the starch granule of wheat endosperm at 15 days post anthesis (Lane 5); 20 

15 days post anthesis (Lane 6); and at 25 days post anthesis (Lane 7). 

Figure 2 is a copy of a schematic representation comparing the nucleotide sequences 
of cDNA clones designated wSSIIA, wSSIIB and wSSIID, encoding the starch 
synthase II polypeptides from wheat, using the PILEUP programme of Devereaux et 
20 a/. (1984). 

Figure 3 is a copy of a schematic representation comparing the deduced amino acid 
sequences of starch synthase II from wheat (wSSIIA, wSSIIB and wSSIID), maize 
( ma j ze SSIIa and maize SSHb; Harn et a/., 1998), pea (pea SSII; Dry et a! 1992) and 

25 potato (potato SSII; van der Leij et a/., 1991). Identical amino acid residues among 
each of these sequences are indicated below the sequences with "*". The alignments 
of maize SSIIa with maize SSIIb, and pea SSII and potato SSII are essentially as 
described in Harn etai (1998) and Edwards etai (1995). All sequences are aligned 
to position the transit peptide cleavage site below the arrow (1) between residues 59 

30 and 60 of the wSSIIA sequence. The wSSIIpl sequence, the sequence of SGP-B1 
(peptide3), and of eight conserved regions are annotated and underlined. 
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Figure 4 is a copy of a photographic representation of a northern blot showing the 
expression of wheat wSSII mRNA in wheat plants. Total RNAs were isolated from 
leaves pre-anthesis florets and endosperm of the wheat cultivar "Gabo", grown under 
a photoperiod comprising 16 hours daylength, and at 18 C during the day, and at 

o 

5 13 C during the night cycle, and probed with the wSSIIp2 DNA fragment. The source 
of each RNA is indicated at the top of the Figure as follows: Lane 1 , leaf; Lane 2, pre- 
anthesis florets; Lanes 3-11, endosperm at: 4 days post-anthesis (Lane 3); 6 days 
post-anthesis (Lane 4); 8 days post-anthesis (Lane 5); 10 days post-anthesis (Lane 
6);12 days post-anthesis (Lane 7); 15 days post-anthesis (Lane 8); 18 days post- 
10 anthesis (Lane 9); 21 days post-anthesis (Lane 10); and 25 days post-anthesis (Lane 
11). 

Figure 5 is a copy of a photographic representation showing the localization of wheat 
starch synthase II genes on the wheat genome by PCR, using the primers ssllc, sslld 

15 and sslle in the amplification reaction. The nullisomic-tetrasomic genomic DNA of 
wheat cv. Chinese Spring was used as template DNA. Lane D, Triticum tauschii; Lane 
AB, Accession line N7DT7B having no 7D chromosome and four copies of the 7B 
chromosome; Lane AD, Accession line N7BT7A having no 7B chromosome and four 
copies of the 7A chromosome; Lane BD, Accession line N7AT7B having no 7A 

20 chromosome and four copies of the 7B chromosome; Lane ABD, wheat cv. Chinese 
Spring. PCR products derived from each cDNA clone are labelled. The results indicate 
that the cDNA clones, wSSIIB, wSSIIA and wSSIID are derived from the B-, A- and D- 
genomes of wheat, respectively. 

25 Figure 6 is a schematic representation showing the organisation of introns (lines) and 
exons (boxes) in the wheat SSII gene shown in SEQ ID NO: 37. The scale (bases), 
relative to the nucleotide sequence set forth in SEQ ID NO: 37, is provided at the 
bottom of the figure. 

30 Figure 7 is a schematic representation comparing the deduced amino acid Sequences 
of the maize, potato and wheat SSIII polypeptides. 
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Figure 8 is a copy of a photographic representation showing the expression of wheat 
wSSIII mRNA in wheat. Total RNAs were isolated from the endosperm of the wheat 
cultivars Wyuna (Panel a) and Gabo (Panel b) leaves pre-anthesis florets and 
endosperm of the wheat cultivar "Gabo", grown under a photoperiod comprising 16 

o O 

5 hours daylength, and at 18 C during the day cycle, and at 13 C during the night cycle, 
and probed with the wSSIIIpl DNA fragment derived from wSSIII.B3 cDNA. The 
source of each RNA is indicated at the top of the Figure as follows: Lane 1 , endosperm 
at: 4 days post-anthesis; Lane 2, endosperm at 6 days post-anthesis; Lane 4, 
endosperm at 8 days post-anthesis; Lane 4, endosperm at 10 days post-anthesis; 
10 Lane 5, endosperm at 12 days post-anthesis; Lane 6, endosperm at 15 days post- 
anthesis; Lane 7, endosperm at 18 days post-anthesis; Lane 8, endosperm at 21 days 
post-anthesis; Lane 9, endosperm at 25 days post-anthesis; and Lane 10, endosperm 
at 31 days post-anthesis (Panel a only). In panel (c), L refers to leaf RNA. and P refers 
to RNA from pre-anthesis florets derived from the cultivar Gabo. 

15 

Figure 9 is a schematic representation showing the position of conserved amino acid 
sequences within four wheat starch synthase proteins. The eight highly-conserved 
regions between the wheat starch synthase polypeptides are underlined and annotated 
at the top of each group of amino acid sequences. The sequences included in the 
20 alignment are the wheat SSM-A1 and wheat SSIII polypeptides of the present 
invention; wheat GBSS (wGBSS; Yan etai, 1999); wheat SSI (wSS1; Li et a/., 1999); 
wheat SSII (wSS2; SEQ ID NO: 4); and wheat SSIII (wSS3; SEQ ID NO: 8). 

Figure 10 is a schematic representation showing the relationships between the 
25 primary amino acid sequences of starch synthases (SS) and glycogen synthase of £. 
coli (GS). The dendrogram was generated by the program PILEUP (Devereaux etai, 
1984). The amino acid sequences used for the analysis are those of the wheat SSIIA, 
wheat SSIIB, wheat SSIID, and wheat SSIII polypeptides of the present invention 
compared to the deduced amino acid sequences of wheat GBSS (Clark et a/., 1991 ), 
30 wheat SSI (Li etai., 1999), rice GBSS (Okagaki, 1992), rice SSI (Baba etai, 1993), 
maize GBSS (Kloesgen etai, 1986), maize SSI (Knight etai, 1998), maize SSIIa and 
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maize SSIIb (Ham et ai, 1998), maize SSIII (Gao etal., 1998), pea GBSS (Dry era/.. 
1992), pea SSII (Dry etal., 1992), potato GBSS (van derLeij etal., 1991), potato SSI 
(Genbank accession number: STSTASYNT), potato SSII (Edwards et ai, 1995), potato 
SSIII (Abel et ai, 1996), and E. coli glycogen synthase (GS) (Kumar et ai, 1986). Five 
5 groups of enzymes included in the alignment are granule-bound starch synthase 
(GBSS), starch synthase-l (SSI), starch synthase-ll (SSII), starch synthase-lll (SSIII) 
and glycogen synthase (GS). 

Figure 11 is a schematic representation showing the position of conserved regions 
10 within cereal starch synthase genes. Comparisons of cereal starch synthases were 
made based on their deduced amino acid sequences and 8 conserved regions 
identified. Conserved regions are shown in bold and transit peptides (where defined) 
in grey. The sequences included in the alignment are the wheat SSII-A1 and wheat 
SSIII polypeptides of the present invention; wheat GBSS (Ainsworth et ai, 1993); 
15 wheat SSI (Li et ai, 1999); maize SSIIa (Ham et ai, 1998); and maize dull-1(Gao et 
ai, 1998). 

Figure 12 is a copy of a schematic representation of a gene map showing the 
alignment of fragments 1 to 6 of the genomic SSIII gene (lower line) with the 
20 corresponding SSIII cDNA clone (upper line). Raised regions in the genomic clone 
fragments (lower line) represent protein-encoding regions of the gene. 

Figure 13 is a schematic representation showing the organisation of introns (lines) and 
exons (boxes) in the wheat SSIII gene shown in SEQ ID NO: 38. The scale (bases), 
25 relative to the nucleotide sequence set forth in SEQ ID NO: 38, is provided at the 
bottom of the figure. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

30 One aspect of the present invention provides an isolated nucleic acid molecule which 
comprises a sequence of nucleotides which encodes, or is complementary to a nucleic 
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acid molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof selected from the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence set forth 

5 in any one of SEQ ID NOS:2,4, Or 6; and 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence set forth 
in any one of SEQ ID NOS: 8 or 10. 

10 Alternatively or in addition, the isolated nucleic acid molecule of the present invention 
encodes a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof and comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 1,3, 5, or 37. 

15 Alternatively or in addition, the isolated nucleic acid molecule of the present invention 
encodes a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof and comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 7, 9, or 38. 

20 As used herein, the term "starch synthase" shall be taken to refer to any enzymatically- 
active peptide, polypeptide, oligopeptide, polypeptide, protein or enzyme molecule that 
is at least capable of transferring a glucosyl moiety from ADP-glucose to an a-1 ,4- 
glucan molecule, or a peptide, polypeptide, oligopeptide or polypeptide fragment of 
such an enzymatically-active molecule. 

25 

The term "wheat starch synthase" refers to a starch synthase derived from hexaploid 
wheat or barley or a progenitor species, or a relative thereto such as the diploid 
Thticum tauschii or other diploid, tetraploid, aneuploid, polyploid, nullisomic, or a 
wheat/barley addition line, amongst others, the only requirement that the genomic DNA 
30 is at least about 80% identical to the genome of a wheat plant as determined by 
standard DNA melting curve analyses. 
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The term "starch synthase II" or "wSSII" or similar term shall be taken to refer to a 
starch synthase as hereinbefore defined that is detectable in the starch granule of a 
plant seed endosperm and possesses one or more properties selected from the group 
consisting of: 

5 (i) it is immunologically cross-reactive with the wheat starch granule 

proteins designated Sgp-B1 and/or Sgp-D1 and/or Sgp-A1 , having estimated 

molecular weights of about 85 kDa to about 115 kDa; 

(ii) it is encoded by one of a homeologous set of genes localised on wheat 

chromosomes 7B or 7A or 7D; 
10 (iii) it is encoded by a nucleotide sequence that comprises at least about 1 5 

nucleotides in length derived from any one or more of SEQ ID NOS: 1 , 3, 5, or 

37 or a complementary nucleotide sequence thereto; 

(iv) it is encoded by a nucleotide sequence that is at least about 85% 
identical to one or more of the nucleotide sequences set forth in SEQ ID NOS: 

15 1,3, 5, or 37, or a complementary nucleotide sequence thereto; 

(v) it comprises an amino acid sequence having at least about 85% identity 
to one or more of SEQ ID NOS: 2 or 4 or 6; 

(vi) it comprises at least about 5 contiguous amino acids, preferably at least 
about 1 0 contiguous amino acids, more preferably at least about 1 5 contiguous 

20 amino acids, even more preferably at least about 20 contiguous amino acids 
and still even more preferably at least about 25-50 contiguous amino acids of 
the amino acid sequences set forth in SEQ ID NOS: 2 or 4 or 6; 

(vii) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 

25 (a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
30 (f) NGQWLLGSA; 

(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 
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(h)TGGLVDTV, 
in addition to any one or more of (i) to (vi); and 

(viii) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 
5 (a) KTGGLGDVAGA; 

(b) GHRVMVWPRY; 

(c) N D WHTALLP VYLKAYY; 

(d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 
10 (f) DVQLVMLGTG; 

(g) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h) VGG(V/L)RDTV. 

in addition to any one or more of (i) to (vi). 

15 The term "starch synthase III" or "wSSIII" or similar term shall be taken to refer to a 
starch synthase as hereinbefore defined that possesses one or more properties 
selected from the group consisting of: 

(i) it is encoded by a nucleotide sequence that comprises at least about 1 5 
nucleotides in length derived from any one or more of SEQ ID NOS: 7, 9, 1 1- 

20 1 6, or 38, or a complementary nucleotide sequence thereto; 

(ii) it is encoded by a nucleotide sequence that is at least about 85% 
identical to one or more of the nucleotide sequences set forth in SEQ ID NOS: 
7, 9, 11-16, or 38, or a complementary nucleotide sequence thereto; and 

(iii) it comprises an amino acid sequence having at least about 85% identity 
25 to one or more of SEQ ID NOS: 8 or 10; 

(iv) it comprises at least about 5 contiguous amino acids, preferably at least 
about 10 contiguous amino acids, more preferably at least about 15 contiguous 
amino acids, even more preferably at least about 20 contiguous amino acids 
and still even more preferably at least about 25-50 contiguous amino acids of 

30 the amino acid sequences set forth in SEQ ID NOS: 8 or 10; 

(v) which comprises a conserved amino acid sequence having at least 25% 
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5 



identity to an amino acid sequence selected from the group consisting of: 

(a) KVGGLGDVVTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV 



10 



in addition to any one or more of (i) to (iv); and 



(vi) it which comprises a conserved amino acid sequence having at least 
25% identity to an amino acid sequence selected from the group consisting of: 

(a) KTGGLGDVAGA; 

(b) GHRVMVWPRY; 



in addition to any one or more of (i) to (iv). 

In a more preferred embodiment, the WSSII or WSSIII polypeptide encoded by the 
nucleic acid molecule of the present invention will comprise a substantial contiguous 
25 region of any one of SEQ ID NOS: 2, 4, 6, 8 or 10 or 17 sufficient to possess the 
biological activity of a starch synthase polypeptide. 

For the purposes of nomenclature, the nucleotide sequence set forth in SEQ ID NO: 
1 relates to the cDNA molecule encoding the WSSII (i.e. Sgp-B1) polypeptide of 
30 wheat. The amino acid sequence of the corresponding polypeptide is set forth herein 
as SEQ ID NO:2. The nucleotide sequence set forth in SEQ ID NO: 3 relates to the 



15 



(c) NDWHTALLPVYLKAYY; 

(d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 

(f) DVQLVMLGTG; 



20 



(g) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h) VGG(V/L)RDTV, 
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cDNA molecule encoding the WSSII (i.e. Sgp-A1) polypeptide of wheat. The amino 
acid sequence of the corresponding polypeptide is set forth herein as SEQ ID NO:4. 
The nucleotide sequence set forth in SEQ ID NO: 5 relates to the cDNA molecule 
encoding the WSSII (i.e. Sgp-D1) polypeptide of wheat. The amino acid sequence of 
5 the corresponding polypeptide is set forth herein as SEQ ID NO:6. The nucleotide 
sequences set forth in SEQ ID NOs: 7 and 9 relate, respectively, to full-length and 
partial cDNA molecules encoding the WSSIII polypeptide of wheat. The amino acid 
sequences of the corresponding polypeptides are set forth herein as SEQ ID NOS: 8 
and 10, respectively. The nucleotide sequences set forth in SEQ ID NOs: 11 to 16 

10 relates to fragments of the genomic gene encoding the WSSIII polypeptide of wheat, 
significant protein-encoding regions of which are described by reference to Table 4 
and Figure 1 1 . The nucleotide sequence set forth in SEQ ID NO: 37 relates to the 
WSSII genomic gene of Triticum tauschii, corresponding to the WSSII gene of the D- 
genome of wheat, which encodes the WSSIII polypeptide. The nucleotide sequence 

15 set forth in SEQ ID NO: 38 relates to the wheat WSSIII genomic gene. 

Preferably, the isolated nucleic acid molecule of the present invention comprises a 
sequence of nucleotides which encodes, or is complementary to a nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 

20 molecule or a functional subunit thereof which comprises an amino acid sequence 
which is at least about 85% identical overall to an amino acid sequence set forth in any 
one of SEQ ID NOS: 2, 4, 6, 8, or 10 and more preferably, which additionally 
comprises which comprises one or more amino acid sequences selected from the 
group consisting of: 

25 (a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPI VG I ITRLTAQKG ; 
30 (f) NGQWLLGSA; 

(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; 
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(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
5 (I) GIVNGIDNMEWNPEVD; 

(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(E/V)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV. 

10 

The present invention clearly extends to homologues, analogues and derivatives of the 
wheat starch synthase II and III genes exemplified by the nucleotide sequences set 
forth herein as SEQ ID NOs: 1 , 3, 5, 7, 9,1 1-16, 37 or 38. 

15 Preferred starch synthase genes may be derived from a naturally-occurring starch 
synthase gene by standard recombinant techniques. Generally, a starch synthase 
gene may be subjected to mutagenesis to produce single or multiple nucleotide 
substitutions, deletions and/or additions. Nucleotide insertional derivatives of the 
starch synthase gene of the present invention include 5' and 3' terminal fusions as 

20 well as intra-sequence insertions of single or multiple nucleotides. Insertional 
nucleotide sequence variants are those in which one or more nucleotides are 
introduced into a predetermined site in the nucleotide sequence although random 
insertion is also possible with suitable screening of the resulting product. Deletional 
variants are characterised by the removal of one or more nucleotides from the 

25 sequence. Substitutional nucleotide variants are those in which at least one nucleotide 
in the sequence has been removed and a different nucleotide inserted in its place. 
Such a substitution may be "silent" in that the substitution does not change the amino 
acid defined by the codon. Alternatively, substituents are designed to alter one amino 
acid for another similar acting amino acid, or amino acid of like charge, polarity, or 

30 hydrophobicity. 



For the present purpose, "homologues" of a nucleotide sequence shall be taken to 
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refer to an isolated nucleic acid molecule which is substantially the same as the nucleic 
acid molecule of the present invention or its complementary nucleotide sequence, 
notwithstanding the occurrence within said sequence, of one or more nucleotide 
substitutions, insertions, deletions, or rearrangements. 

5 

"Analogues" of a nucleotide sequence set forth herein shall be taken to refer to an 
isolated nucleic acid molecule which is substantially the same as a nucleic acid 
molecule of the present invention or its complementary nucleotide sequence, 
notwithstanding the occurrence of any non-nucleotide constituents not normally 
10 present in said isolated nucleic acid molecule, for example carbohydrates, 
radiochemicals including radionucleotides, reporter molecules such as, but not limited 
to DIG, alkaline phosphatase or horseradish peroxidase, amongst others. 

"Derivatives" of a nucleotide sequence set forth herein shall be taken to refer to any 
15 isolated nucleic acid molecule which contains significant sequence similarity to said 
sequence or a part thereof. Generally, the nucleotide sequence of the present 
invention may be subjected to mutagenesis to produce single or multiple nucleotide 
substitutions, deletions and/or insertions. Nucleotide insertional derivatives of the 
nucleotide sequence of the present invention include 5' and 3' terminal fusions as well 
20 as intra-sequence insertions of single or multiple nucleotides or nucleotide analogues. 
Insertional nucleotide sequence variants are those in which one or more nucleotides 
or nucleotide analogues are introduced into a predetermined site in the nucleotide 
sequence of said sequence, although random insertion is also possible with suitable 
screening of the resulting product being performed. Deletional variants are 
25 characterised by the removal of one or more nucleotides from the nucleotide 
sequence. Substitutional nucleotide variants are those in which at least one nucleotide 
in the sequence has been removed and a different nucleotide or nucleotide analogue 
inserted in its place. 

30 The present invention extends to the isolated nucleic acid molecule when integrated 
into the genome of a cell as an addition to the endogenous cellular complement of 
starch synthase genes, irrespective of whether or not the introduced nucleotide 
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sequence is translatable or non-translatable to produce a polypeptide. The present 
invention clearly contemplates the introduction of additional copies of starch synthase 
genes into plants, particularly wheat plants, in the antisense orientation to reduce the 
expression of particular wheat starch synthase genes. As will be known to those skilled 
5 in the art, such antisense genes are non-translatable, notwithstanding that they can be 
expressed to produce antisense mRNA molecules. 

The said integrated nucleic acid molecule may, or may not, contain promoter 
sequences to regulate expression of the subject genetic sequence. 

10 

Accordingly, the present invention clearly encompasses preferred homologues, 
analogues and derivatives that comprise a sequence of nucleotides which encodes, 
or is complementary to a nucleic acid molecule which encodes a wheat starch 
synthase polypeptide, protein or enzyme molecule or a functional subunit thereof 
15 selected from the following: 

(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 2, 4, or 6; 
20 (ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 

functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
or SEQ ID NOS: 8 or 10; 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
25 subunit thereof which comprises a conserved amino acid sequence having at 

least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

30 (c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 
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(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV 

5 and wherein said wheat starch synthase polypeptide further comprises an 
amino acid sequence having at least about 85% identity overall to an amino 
acid sequence set forth in any one of SEQ. ID NOS: 2, 4, 6, 8 or 10; and 
(iv) a wheat starch synthase polypeptide, protein or enzyme or functional 
subunit thereof which comprises a conserved amino acid sequence having at 

10 least 25% identity to an amino acid sequence selected from the group 

consisting of: 

(a) KTGGLGDVAGA; 

(b) GHRVMWVPRY; 

(c) NDWHTALLPVYLKAYY; 
15 (d) GIVNGIDNMEWNPEVD; 

(e) DVPLLGFIGRLDGQKG; 

(f) DVQLVMLGTG; 

(g) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 

(h) VGG(V/L)RDTV, 

20 and wherein said wheat starch synthase polypeptide further comprises an 

amino acid sequence having at least about 85% identity overall to an amino 
acid sequence set forth in any one of SEQ ID NOS: 2, 4, 6, 8 or 10. 

Preferably, the isolated nucleic acid molecule encodes a starch synthase polypeptide, 
25 protein or enzyme that comprises two, more preferably three, more preferably four, 
more preferably five, more preferably six, more preferably seven and even more 
preferably eight of the conserved amino acid motifs listed supra. Even more preferably, 
the said amino acid motifs are located in a relative configuration such as that shown 
for the wheat SSII or wheat SSIII polypeptides described herein. 

30 

In a preferred embodiment, the isolated nucleic acid molecule encodes a starch 
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synthase polypeptide, protein or enzyme having at least about 90% amino acid 
sequence identity to any one of SEQ ID NOS: 2, 4, 6, 8 or 10, more preferably having 
at least about 95% or about 97% or about 99% identity to any one of said amino acid 
sequences. 

5 

In an alternative embodiment, the present invention provides an isolated nucleic acid 
molecule which encodes a wheat starch synthase polypeptide, protein or enzyme 
molecule or a functional subunit thereof, wherein said nucleic acid molecule comprises 
a nucleotide sequence having at least about 85% nucleotide sequence identity to any 
10 one of SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 37, or 38, or a degenerate nucleotide 
sequence thereto or a complementary nucleotide sequence thereto. 

By "degenerate nucleotide sequence" is meant a nucleotide sequence that encodes 
a substantially identical amino acid sequence as a stated nucleotide sequence. 

15 

In a preferred embodiment, the isolated nucleic acid molecule comprises the 
nucleotide sequence set forth in anyone of SEQ ID NOS: 1,3, 5, 7, 9,11-16, 37, or 38, 
or is at least about 90% identical, more preferably at least about 95% or 97% or 99% 
identical to all or a protein-encoding part thereof. 

20 

In an alternative embodiment, preferred homologues, analogues and derivatives of the 
nucleic acid molecule of the present invention encodes a wheat starch synthase 
polypeptide, protein or enzyme molecule or a functional subunit thereof and comprises 
a nucleotide sequence that is capable of hybridising under at least moderate 
25 stringency hybridisation conditions to at least about 30 contiguous nucleotides derived 
from any one of SEQ ID NOS: 1, 3, 5 f 7, 9,11-16, 37, or 38, or a complementary 
nucleotide sequence thereto. 

For the purposes of defining the level of stringency, a low stringency is defined herein 
30 as being a hybridisation and/or a wash carried out in 6xSSC buffer, 0.1% (w/v) SDS 
at 28 °C. Generally, the stringency is increased by reducing the concentration of SSC 
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buffer, and/or increasing the concentration of SDS and/or increasing the temperature 
of the hybridisation and/or wash. A moderate stringency comprises a hybridisation 
and/or a wash carried out in 0.2 x SSC-2 x SSC buffer, 0.1% (w/v) SDS at 42°C to 
65°C, while a high stringency comprises a hybridisation and/or a wash carried out in 
5 0.1xSSC-0.2 x SSC buffer, 0.1% (w/v) SDS at a temperature of at least 55°C. 
Conditions for hybridisations and washes are well understood by one normally skilled 
in the art. For the purposes of further clarification only, reference to the parameters 
affecting hybridisation between nucleic acid molecules is found in pages 2.10.8 to 
2.10.16. of Ausubel etal. (1987), which is herein incorporated by reference. 

10 

Those skilled in the art will be aware of procedures for the isolation of further wheat 
starch synthase genes to those specifically described herein or homologues, 
analogues or derivatives of said genes, for example further cDNA sequences and 
genomic gene equivalents, when provided with one or more of the nucleotide 

15 sequences set forth in SEQ ID NOs: 1, 3, 5, 7, 9,11-16, 37, or 38. In particular, 
amplifications and/or hybridisations may be performed using one or more nucleic acid 
primers or hybridisation probes comprising at least 10 contiguous nucleotides and 
preferably at least about 20 contiguous nucleotides or 50 contiguous nucleotides 
derived from the nucleotide sequences set forth herein, to isolate cDNA clones, mRNA 

20 molecules, genomic clones from a genomic library (in particular genomic clones 
containing the entire 5' upstream region of the gene including the promoter sequence, 
and the entire coding region and 3'-untranslated sequences), and/or synthetic 
oligonucleotide molecules, amongst others. The present invention clearly extends to 
such related sequences. 

25 

Accordingly, a second aspect of the present invention provides a method of isolating 
a nucleic acid molecule that encodes a starch synthase polypeptide, protein or enzyme 
said method comprising: 

(i) hybridising a probe or primer comprising at least about 1 5 contiguous 
30 nucleotides in length derived from any one of SEQ ID NOS 1 , 3, 5, 7, 9,1 1-1 6, 
37, or 38, or a complementary nucleotide sequence thereto to single-stranded 
or double-stranded mRNA, cDNA or genomic DNA; and 
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(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
means. 

Preferably, the detecting means is a reporter molecule covalently attached to the probe 
5 or primer molecule or alternatively, a polymerase chain reaction format. 

An alternative method contemplated in the present invention involves hybridising two 
nucleic acid "primer molecules" to a nucleic acid "template molecule" which comprises 
a related starch synthase gene or related starch synthase genetic sequence or a 

10 functional part thereof, wherein the first of said primers comprises contiguous 
nucleotides derived from any one or more of SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 37, or 
38, and the second of said primers comprises contiguous nucleotides complementary 
to anyoneormoreof SEQIDNOS: 1,3, 5, 7, 9,11-16, 37, or 38. Specific nucleic 
acid molecule copies of the template molecule are amplified enzymatically in a 

15 polymerase chain reaction, a technique that is well known to one skilled in the art. 

In a preferred embodiment, each nucleic acid primer molecule is at least 10 
nucleotides in length, more preferably at least 20 nucleotides in length, even more 
preferably at least 30 nucleotides in length, still more preferably at least 40 nucleotides 
20 in length and even still more preferably at least 50 nucleotides in length. 

Furthermore, the nucleic acid primer molecules consists of a combination of any of the 
nucleotides adenine, cytidine, guanine, thymidine, or inosine, or functional analogues 
or derivatives thereof which are at least capable of being incorporated into a 
25 polynucleotide molecule without having an inhibitory effect on the hybridisation of said 
primer to the template molecule in the environment in which it is used. 

Furthermore, one or both of the nucleic acid primer molecules may be contained in an 
aqueous mixture of other nucleic acid primer molecules, for example a mixture of 
30 degenerate primer sequences which vary from each other by one or more nucleotide 
substitutions or deletions. Alternatively, one or both of the nucleic acid primer 
molecules may be in a substantially pure form. 
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The nucleic acid template molecule may be in a recombinant form, in a virus particle, 
bacteriophage particle, yeast cell, animal cell, or a plant cell. Preferably, the nucleic 
acid template molecule is derived from a plant cell, tissue or organ, in particular a cell, 
tissue or organ derived from a wheat or barley plant or a progenitor species, or a 
5 relative thereto such as the diploid Triticum tauschii or other diploid, tetraploid, 
aneuploid, polyploid, nullisomic, or a wheat/barley addition line, amongst others. 

Those skilled in the art will be aware that there are many known variations of the basic 
polymerase chain reaction procedure, which may be employed to isolate a related 
10 starch synthase gene or related starch synthase genetic sequence when provided with 
the nucleotide sequences set forth herein. Such variations are discussed, for example, 
in McPherson et al (1991). The present invention extends to the use of all such 
variations in the isolation of related starch synthase genes or related starch synthase 
genetic sequences using the nucleotide sequences embodied by the present invention. 

15 

As exemplified herein, the present inventors have isolated several wheat starch 
synthase genes using both hybridisation and polymerase chain reaction approaches, 
employing novel probes and primer sequences to do so. 

20 Accordingly, a third aspect of the invention provides an isolated probe or primer 
comprising at least about 15 contiguous nucleotides in length derived from any one of 
SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 37, or 38, or a complementary nucleotide sequence 
thereto. 

25 Preferably, the probe or primer comprises a nucleotide sequence set forth in any one 
of SEQ ID NOS: 25 to 34. 

The isolated nucleic acid molecule of the present invention may be introduced into and 
expressed in any cell, for example a plant cell, fungal cell, insect cell, animal cell, yeast 
30 cell or bacterial cell. Those skilled in the art will be aware of any modifications which 
are required to the codon usage or promoter sequences or other regulatory 
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sequences, in order for expression to occur in such cells. 

A further aspect of the invention provides a method of assaying for the presence or 
absence of a starch synthase isoenzyme or the copy number of a gene encoding same 
5 in a plant, comprising contacting a biological sample derived from said plant with an 
isolated nucleic acid molecule derived from anyone of SEQ ID NOS 1, 3, 5, 7, 9,11- 
16, 37, or 38, or any one of SEQ ID NOS: 25 to 34, or a complementary nucleotide 
sequence thereto for a time and under conditions sufficient for hybridisation to occur 
and then detecting said hybridisation using a detection means. 

10 

The detection means according to this aspect of the invention is any nucleic acid 
based hybridisation or amplification reaction. 

The hexaploid nature of wheat prevents the straightforward identification of starch 
15 synthase allelic variants by hybridisation using the complete starch synthase-encoding 
sequence, because the similarities between the various alleles generally results in 
significant cross-hybridisation. Accordingly, sequence-specific hybridisation probes are 
required to distinguish between the various alleles. Similarly, wherein PCR is used to 
amplify specific allelic variants of a starch synthase gene, one or more sequence- 
20 specific amplification primers are generally required. As will be apparent from the 
amino acid sequence comparisons provided herein, such as in Figures 3 and 13, non- 
conserved regions of particular wheat starch synthase polypeptides are particularly 
useful for the design of probes and primers that are capable of distinguishing between 
one or more starch synthase polypeptide isoenzyme or allelic variant. The present 
25 invention clearly contemplates the design of such probes and primers based upon the 
sequence comparisons provided herein. 

In the performance of this embodiment of the present invention, the present inventors 
particularly contemplate the identification of wheat starch synthase null alleles or 
30 alternatively, mutations wherein specific amino acids are inserted or deleted or 
substituted, compared to one or more of the wheat SSII or SSIII alleles disclosed 
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herein. Such null alleles and other allelic variants are readily identifiable using PCR 
screening which employs amplification primers based upon the nucleotide and amino 
acid sequences disclosed herein for SSII and/or SSIII. Once identified, the various 
mutations can be stacked or pyramided into one or more new wheat lines, such as by 
5 introgression and/or standard plant breeding and/or recombinant approaches (eg. 
transformation, transfection, etc) thereby producing a novel germplasm which exhibits 
altered starch properties compared to existing lines. DNA markers based upon the 
nucleotide and amino acid sequences disclosed herein for SSII and/or SSIII can be 
employed to monitor the stacking of genes into the new lines and to correlate the 
10 presence of particular genes with starch phenotypes of said lines. 

In this regard, a significant advantage conferred by the present invention is the design 
of new DNA markers that reveal polymorphisms such as, for example, length 
polymorphisms, restriction site polymorphisms, and single nucleotide polymorphisms, 
15 amongst others, between wheat starch synthases and, in particular, between wheat 
GBSS and/or SSI and/or SSII and/or SSIII, or between allelic variants of one or more 
of said starch synthases, that can be used to identify the three genomes of hexaploid 
wheats (i.e., the A, B and D genomes). 

20 Preferably, such DNA markers are derived from the intron region of a starch synthase 
gene disclosed herein, more preferably the wheat SSII and/or the wheat SSIII gene. 
Those skilled in the art will be aware that such regions generally have a higher degree 
of variation than in the protein-encoding regions and, as a consequence, are 
particularly useful in identifying specific allelic variants of a particular gene, such as 

25 allelic variants contained in any one of the three wheat genomes, or alternatively or in 
addition, for the purpose of distinguishing between wheat GBSS, SSI, SSII or SSIII 
genes. 

A further approach contemplated by the present inventors is the design of unique 
30 isoenzyme-specific and/or allele-specific peptides based upon the amino acid 
sequence disclosed herein as SEQ ID NOS: 25 and/or SEQ ID NO: 4 and/or SEQ ID 
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NO: 6 and/or SEQ ID NO: 8 and/or SEQ ID NO: 10, which peptides are then used to 
produce polyclonal or monoclonal antibodies by conventional means. Alternatively, the 
genes encoding these polypeptides or unique peptide regions thereof can be 
introduced in an expressible format into an appropriate prokaryotic or eukaryotic 
5 expression system, where they can be expressed to produce the isoenzyme-specific 
and/or allele-specific peptides for antibody production. Such antibodies may also be 
used as markers for the purpose of both identifying parental lines and germplasms and 
monitoring the stacking of genes in new lines, using conventional immunoassays such 
as, for example, ELISA and western blotting. 

10 

A further aspect of the present invention utilises the above-mentioned nucleic acid 
based assay method in the breeding and/or selection of plants which express or do not 
express particular starch synthase isoenzymes or alternatively, which express a 
particular starch synthase isoenzyme at a particular level in one or more plant tissues. 
15 This aspect clearly extends to the selection of transformed plant material which 
contains one or more of the isolated nucleic acid molecules of the present invention. 

Yet another aspect of the present invention provides for the expression of the nucleic 
acid molecule of the present invention in a suitable host (e.g. a prokaryote or 
20 eukaryote) to produce full length or non-full length recombinant starch synthase gene 
products. 

Hereinafter the term "starch synthase gene product" shall be taken to refer to a 
recombinant product of a starch synthase gene of the present invention. 

25 

Preferably, the recombinant starch synthase gene product comprises an amino acid 
sequence having the catalytic activity of a starch synthase polypeptide or a functional 
mutant, derivative part, fragment, or analogue thereof. 

30 In a particularly preferred embodiment of the invention, the recombinant starch 
synthase gene product is selected from the following: 
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(i) a wheat starch synthase II (wSSII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
ofSEQIDNOS:2,4,or6; 

5 (ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 

functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 8 or 10; and 

(iii) a wheat starch synthase polypeptide, protein or enzyme or functional 
10 subunit thereof which comprises a conserved amino acid sequence having at 

least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

15 (c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 
(0 NGQWLLGSA; 

(g)AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

20 (h)TGGLVDTV; (i) a wheat starch synthase II (wSSII) 

polypeptide, protein or enzyme or functional subunit 
thereof which comprises an amino acid sequence 
which is at least about 85% identical overall to an 
amino acid sequence set forth in any one of SEQ 

25 ID NOS: 2, 4, or 6; 

(ii) a wheat starch synthase III (wSSIII) polypeptide, protein or enzyme or 
functional subunit thereof which comprises an amino acid sequence which is at 
least about 85% identical overall to an amino acid sequence set forth in any one 
of SEQ ID NOS: 8 or 10; 

30 (iii) a wheat starch synthase polypeptide, protein or enzyme or functional 

subunit thereof which comprises a conserved amino acid sequence having at 
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least 25% identity to an amino acid sequence selected from the group 
consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 
(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV. 

20 Accordingly, the present invention clearly extends to homologues, analogues and 
derivatives of the amino acid sequences set forth herein as SEQ ID NOS: 2, 4, 6, 8 
and 10. 

In the present context, "homologues" of an amino acid sequence refer to those 
25 polypeptides, enzymes or proteins which have a similar catalytic activity to the amino 
acid sequences described herein, notwithstanding any amino acid substitutions, 
additions or deletions thereto. A homologue may be isolated or derived from the same 
or another plant species as the species from which the polypeptides of the invention 
are derived. 

30 

"Analogues" encompass polypeptides of the invention notwithstanding the occurrence 



10 
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of any non-naturally occurring amino acid analogues therein. 

"Derivatives" include modified peptides in which ligands are attached to one or more 
of the amino acid residues contained therein, such as carbohydrates, enzymes, 

5 proteins, polypeptides or reporter molecules such as radionuclides or fluorescent 
compounds. Glycosylated, fluorescent, acylated or alkylated forms of the subject 
peptides are particularly contemplated by the present invention. Additionally, 
derivatives of an amino acid sequence described herein which comprises fragments 
or parts of the subject amino acid sequences are within the scope of the invention, as 

10 are homopolymers or heteropolymers comprising two or more copies of the subject 
polypeptides. Procedures for derivatizing peptides are well-known in the art. 

Substitutions encompass amino acid alterations in which an amino acid is replaced 
with a different naturally-occurring or a non-conventional amino acid residue. Such 
15 substitutions may be classified as "conservative", in which an amino acid residue 
contained in a starch synthase gene product is replaced with another naturally- 
occurring amino acid of similar character, for example Gly<-Ala, Val<->lle~Leu, 
Asp<-*Glu, Lys<->Arg, Asn<-*Gln or Phe<->Trp<-»Tyr. 

20 Substitutions encompassed by the present invention may also be "non-conservative", 
in which an amino acid residue which is present in a starch synthase gene product 
described herein is substituted with an amino acid with different properties, such as a 
naturally-occurring amino acid from a different group (eg. substituted a charged or 
hydrophobic amino acid with alanine), or alternatively, in which a naturally-occurring 

25 amino acid is substituted with a non-conventional amino acid. 

Non-conventional amino acids encompassed by the invention include, but are not 
limited to those listed in Table 2. 

30 Amino acid substitutions are typically of single residues, but may be of multiple 
residues, either clustered or dispersed. 
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Amino acid deletions will usually be of the order of about 1-10 amino acid residues, 
while insertions may be of any length. Deletions and insertions may be made to the 
N-terminus, the C-terminus or be internal deletions or insertions. Generally, insertions 
within the amino acid sequence will be smaller than amino- or carboxy-terminal fusions 
5 and of the order of 1-4 amino acid residues, 

A homologue, analogue or derivative of a starch synthase gene product as referred to 
herein may readily be made using peptide synthetic techniques well-known in the art, 
such as solid phase peptide synthesis and the like, or by recombinant DNA 
10 manipulations. Techniques for making substituent mutations at pre-determined sites 
using recombinant DNA technology, for example by M13 mutagenesis, are also well- 
known. The manipulation of nucleic acid molecules to produce variant peptides, 
polypeptides or proteins which manifest as substitutions, insertions or deletions are 
well-known in the art. 

15 

The starch synthase gene products described herein may be derivatized further by the 
inclusion or attachment thereto of a protective group which prevents, inhibits or slows 
proteolytic or cellular degradative processes. Such derivatization may be useful where 
the half-life of the subject polypeptide is required to be extended, for example to 

20 increase the amount of starch produced in the endosperm or alternatively, to increase 
the amount of protein produced in a bacterial or eukaryotic expression system. 
Examples of chemical groups suitable for this purpose include, but are not limited to, 
any of the non-conventional amino acid residues listed in Table 2, in particular a D- 
stereoisomer or a methylated form of a naturally-occurring amino acid listed in Table 

25 1. Additional chemical groups which are useful for this purpose are selected from the 
list comprising aryl or heterocyclic N-acyl substituents, polyalkylene oxide moieties, 
desulphatohirudin muteins, alpha-muteins, alpha-aminophosphonic acids, water- 
soluble polymer groups such as polyethylene glycol attached to sugar residues using 
hydrazone or oxime groups, benzodiazepine dione derivatives, glycosyl groups such 

30 as beta-glycosylamine or a derivative thereof, isocyanate conjugated to a polyol 
functional group or polyoxyethylene polyol capped with diisocyanate, amongst others. 
Similarly, a starch synthase gene product or a homologue, analogue or derivative 
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thereof may be cross-linked or fused to itself or to a protease inhibitor peptide, to 
reduce susceptibility of said molecule to proteolysis. 

in a particularly preferred embodiment, the percentage similarity to in any one of SEQ 
5 ID NOS: 2, 4, 6, 8 or 10 is at least about 90%, more preferably at least about 95%, 
even more preferably at least about 97% and even more preferably at least about 
98%, or about 99% or 100%. 

In a related embodiment, the present invention provides a "sequencably pure" form of 
10 the amino acid sequence described herein. "Sequencably pure" is hereinbefore 
described as substantially homogeneous to facilitate amino acid determination. 

In a further related embodiment, the present invention provides a "substantially 
homogeneous" form of the subject amino acid sequence, wherein the term 

15 "substantially homogeneous" is hereinbefore defined as being in a form suitable for 
interaction with an immunologically interactive molecule. Preferably, the polypeptide 
is at least 20% homogeneous, more preferably at least 50% homogeneous, still more 
preferably at least 75% homogeneous and yet still more preferably at least about 95- 
100% homogenous, in terms of activity per microgram of total protein in the protein 

20 preparation. 

To produce the recombinant polypeptide of the present invention, the coding region 
of a starch synthase gene described herein or a functional homologue, analogue or 
derivative thereof is placed operably in connection with a promoter sequence in the 
25 sense orientation, such that a starch synthase gene product is capable of being 
expressed under the control of said promoter sequence. 

In the present context, the term "in operable connection with" means that expression 
of the isolated nucleotide sequence is under the control of the promoter sequence with 
30 which it is connected, regardless of the relative physical distance of the sequences 
from each other or their relative orientation with respect to each other. 
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Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA 
box which is required for accurate transcription initiation, with or without a CCAAT box 
sequence and additional regulatory elements (i.e. upstream activating sequences, 
5 enhancers and silencers) which alter gene expression in response to developmental 
and/or external stimuli, or in a tissue-specific manner. A promoter is usually, but not 
necessarily, positioned upstream or 5', of a structural gene, the expression of which 
it regulates. Furthermore, the regulatory elements comprising a promoter are usually 
positioned within 2 kb of the start site of transcription of the gene. 

10 

In the present context, the term "promoter 11 is also used to describe a synthetic or 
fusion molecule, or derivative which confers, activates or enhances expression of a 
structural gene or other nucleic acid molecule, particularly in a plant cell and more 
preferably in a wheat plant or other monocotyledonous plant cell, tissue or organ. 
15 Preferred promoters may contain additional copies of one or more specific regulatory 
elements, to further enhance expression and/or to alter the spatial expression and/or 
temporal expression. For example, regulatory elements which confer copper inducibility 
may be placed adjacent to a heterologous promoter sequence, thereby conferring 
copper inducibility on the expression of said molecule. 

20 

Those skilled in the art will be aware that in order to obtain optimum expression of the 
starch synthase gene of the present invention, it is necessary to position said gene in 
an appropriate configuration such that expression is controlled by the promoter 
sequence. Promoters are generally positioned 5' (upstream) to the genes that they 

25 control. In the construction of heterologous promoter/structural gene combinations it 
is generally preferred to position the promoter at a distance from the gene transcription 
start site that is approximately the same as the distance between that promoter and 
the gene it controls in its natural setting, i.e., the gene from which the promoter is 
derived. As is known in the art, some variation in this distance can be accommodated 

30 without loss of promoter function. Similarly, the preferred positioning of a regulatory 
sequence element with respect to a heterologous gene to be placed under its control 
is defined by the positioning of the element in its natural setting, i.e., the genes from 
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which it is derived. Again, as is known in the art, some variation in this distance can 
also occur. 

Examples of promoters suitable for expressing the starch synthase gene of the present 
5 invention include viral, fungal, bacterial, animal and plant derived promoters capable 
of functioning in prokaryotic or eukaryotic cells. Preferred promoters are those capable 
of regulating the expression of the subject starch synthase genes in plants cells, fungal 
cells, insect cells, yeast cells, animal cells or bacterial cells, amongst others. 
Particularly preferred promoters are capable of regulating expression of the subject 
10 nucleic acid molecules in monocotyledonous plant cells. The promoter may regulate 
the expression of the said molecule constitutively, or differentially with respect to the 
tissue in which expression occurs or, with respect to the developmental stage at which 
expression occurs, or in response to external stimuli such as physiological stresses, 
or plant pathogens, or metal ions, amongst others. 

15 

Accordingly, strong constitutive promoters are particularly preferred for the purposes 
of the present invention. 

Examples of preferred promoters include the bacteriophage T7 promoter, 
20 bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 
late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, CaMV 35S 
promoter, SCSV promoter, SCBV promoter and the like. 

Particularly pre/erred promoters operable in plant cells include, for example the CaMV 
25 35S promoter, and the SCBV promoter. Those skilled in ihe art will readily be aware 
of additional promoter sequences other than those specifically described. 

In a particularly preferred embodiment, the promoter may be derived from a genomic 
starch synthase gene. Preferably, the promoter sequence comprises nucleotide 
30 sequences that are linked in vivo to nucleotide sequences set forth in any one of SEQ 
ID NOs: 1, 3, 5, 7, 9,11-16, 37, or 38. By "linked in vivo" means that the promoter is 
present in its native state in the genome of a wheat plant where it controls expression 
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of the starch synthase gene of the present invention. 

Conveniently, genetic constructs are employed to facilitate expression of a starch 
synthase genetic sequence of the present invention or a functional derivative, part, 
5 homologue, or analogue thereof. To produce a genetic construct, the starch synthase 
gene of the invention is inserted into a suitable vector or episome molecule, such as 
a bacteriophage vector, viral vector or a plasmid, cosmid or artificial chromosome 
vector which is capable of being maintained and/or replicated and/or expressed in the 
host cell, tissue or organ into which it is subsequently introduced. The said genetic 
10 construct comprises the subject nucleic acid molecule placed operably under the 
control of a promoter sequence and optionally, a terminator sequence. 

The term "terminator refers to a DNA sequence at the end of a transcriptional unit 
which signals termination of transcription. Terminators are 3'-npn-translated DNA 
15 sequences containing a polyadenylation signal, which facilitates the addition of 
polyadenylate sequences to the 3'-end of a primary transcript. Terminators active in 
bacteria, yeasts, animal cells and plant cells are known and described in the literature. 
They may be isolated from bacteria, fungi, viruses, animals and/or plants. 

20 Examples of terminators particularly suitable for use in expressing the nucleic acid 
molecule of the present invention in plant cells include the nopaline synthase (NOS) 
gene terminator of Agrobacterium tumefaciens, the terminator of the Cauliflower 
mosaic virus (CaMV) 35S gene, and the zein gene terminator from Zea mays. 

25 Genetic constructs will generally further comprise one or more origins of replication 
and/or selectable marker gene sequences. 

The origin of replication can be functional in a bacterial cell and comprise, for example, 
the pUC or the ColE1 origin. Alternatively, the origin of replication is operable in a 
30 eukaryotic cell, tissue and more preferably comprises the 2 micron (2jum) origin of 
replication or the SV40 origin of replication. 
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As used herein, the term "selectable marker gene" includes any gene which confers 
a phenotype on a cell in which it is expressed to facilitate the identification and/or 
selection of cells which are transfected or transformed with a genetic construct of the 
invention or a derivative thereof. 

5 

Suitable selectable marker genes contemplated herein include the ampicillin-resistance 
gene (AmpO, tetracycline-resistance gene (TV), bacterial kanamycin-resistance gene 
(Kan r ), is the zeocin resistance gene (Zeocin is a drug of bleomycin family which is 
trademark of InVitrogen Corporation), the AURI-C gene which confers resistance to the 

10 antibiotic aureobasidin A, phosphinothricin-resistance gene, neomycin 
phosphotransferase gene (nptW), hygromycin-resistance gene, p-glucuronidase (GUS) 
gene, chloramphenicol acetyltransferase (CAT) gene, green fluorescent protein- 
encoding gene or the luciferase gene, amongst others. Those skilled in the art will be 
aware of other selectable marker genes useful in the performance of the present 

1 5 invention and the subject invention is not limited by the nature of the selectable marker 
gene. 

Usually, an origin of replication or a selectable marker gene suitable for use in bacteria 
is physically-separated from those genetic sequences contained in the genetic 
20 construct which are intended to be expressed or transferred to a eukaryotic cell, or 
integrated into the genome of a eukaryotic cell. 

Standard methods can be used to introduce genetic constructs into a cell, tissue or 
organ for the purposes of modulating gene expression. Particularly preferred methods 

25 suited to the introduction of synthetic genes and genetic constructs comprising same 
to eukaryotic cells include liposome-mediated transfection or transformation, 
transformation of cells with attenuated virus particles or bacterial cells and standard 
procedures for the transformation of plant and animal cells, tissues, organs or 
organisms. Any standard means may be used for their introduction including cell 

30 mating, transformation or transfection procedures known to those skilled in the art or 
described by Ausubel et ai (1 992). 
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in a further embodiment of the present invention, the starch synthase genes of the 
present invention and genetic constructs comprising same are adapted for integration 
into the genome of a cell in which it is expressed. Those skilled in the art will be aware 
that, in order to achieve integration of a genetic sequence or genetic construct into the 
5 genome of a host cell, certain additional genetic sequences may be required. In the 
case of plants, left and right border sequences from the T-DNA of the Agrobacterium 
tumefaciens Ti plasmid will generally be required. 

The invention further contemplates increased starch and/or modified starch 
10 composition in transgenic plants expressing the nucleic acid molecule of the invention 
in the sense orientation such that the activity of one or more starch synthase 
isoenzymes is increased therein. By increasing the level of one or more starch 
synthase isoenzymes, the deposition of starch in the amyloplast or chloroplast is 
increased and/or a modified starch granule structure is produced and/or starch 
15 composition is modified and/or the amylose/amylopectin ratio is altered in the plant. 

Wherein it is desired to increase the synthesis of a particular starch synthase 
isoenzyme in a plant cell, the coding region of a starch synthase gene is placed 
operably behind a promoter, in the sense orientation, such that said starch synthase 
20 is expressed under the control of said promoter sequence. In a preferred embodiment, 
the starch synthase genetic sequence is a starch synthase genomic sequence, cDNA 
molecule or protein-coding sequence. 

Wherein it is desirable to reduce the level of a particular starch synthase isoenzyme 
25 in a plant cell, the nucleic acid molecule of the present invention can be expressed in 
the antisense orientation, as an antisense molecule or a ribozyme molecule, under the 
control of a suitable promoter. 

Alternatively, the nucleic acid molecule of the present invention may also be expressed 
30 in the sense orientation, in the form of a co-suppression molecule, to reduce the level 
of a particular starch synthase isoenzyme in a plant cell. As will be known to those 
skilled in the art, co-suppression molecules that comprise inverted repeat sequences 
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of a target nucleic acid molecule provide optimum efficiency at reducing expression of 
said target nucleic acid molecule and, as a consequence, the present invention clearly 
contemplates the use of inverted repeat sequences of any one or more of the starch 
synthase genetic sequences exemplified herein, or inverted repeat sequences of a 
5 homologue, analogue or derivative of said starch synthase genetic sequences, to 
reduce the level of a starch synthase isoenzyme in a plant. 

The expression of an antisense, ribozyme or co-suppression molecule comprising a 
starch synthase gene in a cell such as a plant cell, fungal cell, insect cell, animal cell, 

10 yeast cell or bacterial cell, may also increase the availability of carbon as a precursor 
for a secondary metabolite other than starch (e.g. sucrose or cellulose). By targeting 
the endogenous starch synthase gene, expression is diminished, reduced or otherwise 
lowered to a level that results in reduced deposition of starch in the amyloplast or 
chloroplast and/or leads to modified starch granule structure and/or composition 

15 and/or altered amylose/amylopectin ratio. 

Accordingly, a further aspect of the present invention provides a method of modifying 
the starch content and/or starch composition of one or more tissues or organs of a 
plant, comprising expressing therein a sense molecule, antisense molecule, ribozyme 

20 molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: 1, 3, 5, 7, 
9,11-16, 37, or 38, or a complementary nucleotide sequence thereto for a time and 
under conditions sufficient for the enzyme activity of one or more starch synthase 
isoenzymes to be modified. This aspect of the invention clearly extends to the 

25 introduction of the sense molecule, antisense molecule, ribozyme molecule, co- 
suppression molecule, or gene-targeting molecule to isolated plant cells, tissues or 
organs or organelles by cell fusion or transgenic means and the regeneration of intact 
plants therefrom. 

30 Co-suppression is the reduction in expression of an endogenous gene that occurs 
when one or more copies of said gene, or one or more copies of a substantially similar 
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gene are introduced into the cell, preferably in the form of an inverted repeat structure. 

The present inventors have discovered that the genetic sequences disclosed herein 
are capable of being used to modify the level of starch when expressed, particularly 
5 when expressed in plants cells. Accordingly, the present invention clearly extends to 
the modification of starch biosynthesis in plants, in particular wheat or barley plants or 
a progenitor plant species, or a relative thereto such as the diploid Triticum tauschii 
or other diploid, tetraploid, aneuploid, polyploid, nullisomic, or a wheat/barley addition 
line, amongst others. 

In particular, the present invention contemplates decreased starch production and/or 
modified starch composition in transgenic plants expressing the nucleic acid molecule 
of the invention in the antisense orientation or alteratively, expressing a ribozyme or 
co-suppression molecule comprising the nucleic acid sequence of the invention such 
15 that the activity of one or more starch synthase isoenzymes is decreased therein. 



In the context of the present invention, an antisense molecule is an RNA molecule 
20 which is transcribed from the complementary strand of a nuclear gene to that which is 
normally transcribed to produce a "sense" mRNA molecule capable of being translated 
into a starch synthase polypeptide. The antisense molecule is therefore 
complementary to the mRNA transcribed from a sense starch synthase gene or a part 
thereof. Although not limiting the mode of action of the antisense molecules of the 
25 present invention to any specific mechanism, the antisense RNA molecule possesses 
the capacity to form a double-stranded mRNA by base pairing with the sense mRNA, 
which may prevent translation of the sense mRNA and subsequent synthesis of a 
polypeptide gene product. 

30 Ribozymes are synthetic RNA molecules which comprise a hybridising region 
complementary to two regions, each of at least 5 contiguous nucleotide bases in the 
target sense mRNA. In addition, ribozymes possess highly specific endoribonuclease 
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activity, which autocatalytically cleaves the target sense mRNA. A complete 
description of the function of ribozymes is presented by Haseloff and Gerlach (1988) 
and contained in International Patent Application No. WO89/05852. 

5 The present invention extends to ribozyme which target a sense mRNA encoding a 
native starch synthase gene product, thereby hybridising to said sense mRNA and 
cleaving it, such that it is no longer capable of being translated to synthesise a 
functional polypeptide product. 

10 According to this embodiment, the present invention provides a ribozyme or antisense 
molecule comprising at least 5 contiguous nucleotide bases derived from any one of 
SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 37, or 38, or a complementary nucleotide sequence 
thereto or a homologue, analogue or derivative thereof, wherein said antisense or 
ribozyme molecule is able to form a hydrogen-bonded complex with a sense mRNA 

15 encoding a starch synthase gene product to reduce translation thereof. 

In a preferred embodiment, the antisense or ribozyme molecule comprises at least 10 
to 20 contiguous nucleotides derived from any one of SEQ ID NOS: 1, 3, 5, 7, 9,11-16, 
37, or 38, or a complementary nucleotide sequence thereto or a homologue, analogue 
20 or derivative thereof. Although the preferred antisense and/or ribozyme molecules 
hybridise to at least about 10 to 20 nucleotides of the target molecule, the present 
invention extends to molecules capable of hybridising to at least about 50-100 
nucleotide bases in length, or a molecule capable of hybridising to a full-length or 
substantially full-length mRNA encoded by a starch synthase gene. 

25 

Those skilled in the art will be aware of the necessary conditions, if any, for selecting 
or preparing the antisense or ribozyme molecules of the invention. 

It is understood in the art that certain modifications, including nucleotide substitutions 
30 amongst others, may be made to the antisense and/or ribozyme molecules of the 
present invention, without destroying the efficacy of said molecules in inhibiting the 
expression of a starch synthase gene. It is therefore within the scope of the present 
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invention to include any nucleotide sequence variants, homologues, analogues, or 
fragments of the said gene encoding same, the only requirement being that said 
nucleotide sequence variant, when transcribed, produces an antisense and/or 
ribozyme molecule which is capable of hybridising to a sense mRNA molecule which 
5 encodes a starch synthase gene product. 

Gene targeting is the replacement of an endogenous gene sequence within a cell by 
a related DNA sequence to which it hybridises, thereby altering the form and/or 
function of the endogenous gene and the subsequent phenotype of the cell. According 

10 to this embodiment, at least a part of the DNA sequence defined by any one of SEQ 
ID NOS: 1 , 3, 5, 7, 9,1 1-16, 37, or 38 may be introduced into target cells containing an 
endogenous gene that encodes a particular starch synthase isoenzyme, thereby 
replacing said endogenous gene. According to this embodiment, the polypeptide 
product of the gene targetting molecule generally encodes a starch synthase 

15 isoenzyme that possesses different catalytic activity to the polypeptide product of the 
endogenous gene, producing in turn modified starch content and/or composition in the 
target cell 

The present invention extends to genetic constructs designed to facilitate expression 
20 of a sense molecule, an antisense molecule, ribozyme molecule, co-suppression 
molecule, or gene targeting molecule of the present invention. The requirements for 
expressing such molecules are similar to those for expressing a recombinant 
polypeptide as described supra. 

25 The present invention further extends to the production and use of starches and 
proteins produced using the novel genes described herein. Modified starches 
produced by plants which have been selected using marker-assisted selection, or 
alternatively, produced by transgenic plants carrying the introduced starch synthase 
genes, are particularly suitable for use in food products, such as, for example, flour 

30 and flour-based products, in particular those products selected from the group 
consisting of: flour-based sauce; leavened bread; unleavened bread; pasta, noodle; 
cereal; snack food; cake; and pastry. Modified proteins are also suitable for use in non- 
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food products, such as, for example, those non-food products selected from the group 
consisting of: films; coatings; adhesives; building materials; and packaging materials. 

Additionally, starch hydrolysates or undegraded starches are both useful in industry 
5 and, as a consequence, the present invention is useful in applications relating to the 
use of both starch hydrolysates and undegraded starches. By "starch hydrolysates" is 
meant the glucose and glucan components that are obtainable by the enzymatic or 
chemical degradation of starch in chemical modifications and processes, such as 
fermentation. 

10 

Starch produced by plants expressing the sense, antisense, co-suppression, gene- 
targetting or ribozyme molecules of the present invention may exhibit modified 
viscosities and/or gelling properties of its glues when compared to starch derived from 
wild-type plants. Native starches produced by the performance of the inventive method 

15 are useful as an additive in the following: (i) foodstuffs, for the purpose of increasing 
the viscosity or gelling properties of food; (ii) in non-foodstuffs, such as an adjuvant or 
additive in the paper and cardboard industries, for retention or as a size filler, or as a 
solidifying substance or for dehydration, or film coating, amongst others; (iii) in the 
adhesive industry as pure starch glue, as an additive to synthetic resins and polymer 

20 dispersions, or as an extenders for synthetic adhesives; (iv) in the textile and textile 
care industries to strengthen woven products and reduce burring or to thicken dye 
pastes; (v) in the building industry, such as a binding agent in the production of 
gypsum plaster boards, or for the deceleration of the sizing process; (vi) in ground 
stabilization or for the temporary protection of ground particles against water in artificial 

25 earth shifting; (vii) as a wetting agent in plant protectants and fertilizers; (viii) as a 
binding agent in drugs, pharmaceuticals and medicated foodstuff such as vitamins, etc; 
(ix) as an additive in coal and briquettes; (xi) as a flocculent in the processing of coal 
ore and slurries; (xii) as a binding agent in casting processes to increase flow 
resistance and improve binding strength; and (xiii) to improve the technical and optical 

30 quality of rubber and plastic products. Additional applications are not excluded. 

A further aspect of the present invention provides an isolated promoter that is operable 
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in the endosperm of a monocotyledonous plant cell, tissue or organ, and preferably in 
the endosperm of a monocotyledonous plant cell, tissue or organ. According to this 
embodiment, it is preferred that the promoter is derived from a starch synthase gene 
of the present invention, such as a promoter that is linked in vivo to any one of SEQ 
5 ID NOS: 1, 3, 5, 7, 9,11-16, 37, or 38, or a complementary nucleotide sequence 
thereto. 

In a particularly preferred embodiment, the promoter comprises a nucleotide sequence 
derivable from the 5'-upstream region of SEQ ID NO: 1 1 or SEQ ID NO: 37 or SEQ ID 

10 NO: 38, or a complementary nucleotide sequence thereto, an more preferably 
comprises nucleotides 1 to about 287 of SEQ ID NO: 11, or nucleotides 1 to about 
1416 of SEQ ID NO: 37, or nucleotides 1 to about 973 of SEQ ID NO: 38, or a 
complementary nucleotide sequence thereto. The present invention clearly extends 
to promoter sequences that comprise further nucleotide sequences in the region 

15 upstream of the stated nucleotide sequence that are linked in vivo to said nucleotide 
sequence in the wheat genome. 

In a related embodiment, the promoter sequence of the present invention will further 
comprise an exon sequence derived from a starch synthase gene, such as, for 

20 example, an intron I sequence described herein, or a complementary nucleotide 
sequence thereto. Those skilled in the art will be aware that the inclusion of such 
nucleotide sequences may increase the expression of a heterologous structural gene, 
the expression of which is controlled thereby. Preferred intron I sequences include, 
for example, nucleotide sequences in the region of about position 1744 to about 1847 

25 of SEQ ID NO: 37, and/or about position 1 100 to about position 2056 of SEQ ID NO: 
38. Additional sequences comprising intron/exon junction boundary sequences which 
are readily determined by those skilled in the art are not excluded. 



30 



The present invention further extends to the expression of any structural gene operably 
under the control of the starch synthase promoter sequence exemplified herein or a 
functional homologue, analogue or derivative of said promoter sequence. 
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As with other embodiments described herein for expression in cells, a genetic 
construct may be employed to effect said expression and the present invention clearly 
extends to said genetic constructs. 

5 The polypeptide encoded by the structural gene component may be a reporter 
molecule which is encoded by a gene such as the bacterial (3-glucuronidase gene or 
chloramphenicol acetyltransferase gene or alternatively, the firefly luciferase gene. 
Alternatively, wherein it is desirable to alter carbon partitioning within the endosperm, 
the polypeptide may be an enzyme of the starch sucrose biosynthetic pathways. 
10 Preferably, the promoter sequence is used to regulate the expression of one or more 
of the starch synthase genes of the present invention or a sense, antisense, ribozyme, 
co-suppression or gene-targetting molecule comprising or derived from same. 

Recombinant DNA molecules carrying the aforesaid nucleic acid molecule of the 

15 present invention or a sense, antisense, ribozyme, gene-targetting or co-suppression 
molecule and/or genetic construct comprising same, may be introduced into plant 
tissue, thereby producing a "transgenic plant", by various techniques known to those 
skilled in the art. The technique used for a given plant species or specific type of plant 
tissue depends on the known successful techniques. Means for introducing 

20 recombinant DNA into plant tissue include, but are not limited to, transformation 
(Paszkowski et ai t 1984), electroporation (Fromm et a/., 1985), or microinjection of the 
DNA (Crossway et al., 1986), or T-DNA-mediated transfer from Agrobacterium to the 
plant tissue. Representative T-DNA vector systems are described in the following 
references: An ef a/.(1985); Herrera-Estrella etal. (1983a, b); Herrer?.-Estrella etal. 

25 (1 985). Once introduced into the plant tissue, the expression of the introduced gene 
may be assayed in a transient expression system, or it may be determined after 
selection for stable integration within the plant genome. Techniques are known for the 
in vitro culture of plant tissue, and in a number of cases, for regeneration into whole 
plants. Procedures for transferring the introduced gene from the originally transformed 

30 plant into commercially useful cultivars are known to those skilled in the art. 

In general, plants are regenerated from transformed plant cells or tissues or organs on 
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hormone-containing media and the regenerated plants may take a variety of forms, 
such as chimeras of transformed cells and non-transformed cells; clonal transformants 
(e.g., all cells transformed to contain the expression cassette); grafts of transformed 
and untransformed tissues (e.g.,. a transformed root stock grafted to an untransformed 
5 scion in citrus species). Transformed plants may be propagated by a variety of means, 
such as by clonal propagation or classical breeding techniques. For example, a first 
generation (or T1) transformed plants may be selfed to give homozygous second 
generation (or T2) transformed plants, and the T2 plants further propagated through 
classical breeding techniques. 

10 

Accordingly, a still further aspect of the present invention contemplates a transgenic 
plant comprising an introduced sense molecule, antisense molecule, ribozyme 
molecule, co-suppression molecule, or gene-targeting molecule having at least about 
85% nucleotide sequence identity to any one of any one of SEQ ID NOS: 1 , 3, 5, 7, 
15 9,11-16, 37, or 38, or a complementary nucleotide sequence thereto or a genetic 
construct comprising same. The present invention further extends to those plant parts, 
propagules and progeny of said transgenic plant or derived therefrom, the only 
requirement being that said propagules and progeny also carry the introduced nucleic 
acid molecule(s), 

20 

The present invention is further described by reference to the following non-limiting 
examples. 

EXAMPLE 1 
Plant material 

25 Genetic stocks of hexaploid bread wheat Triticum aestivum cv. Chinese Spring with 
various chromosome additions and deletions were kindly supplied by Dr E. Lagudah 
(CSIRO Plant Industry, Canberra) and derived from stocks described in Sears and 
Miller (1985). The hexaploid (Triticum aestivum) wheats cv Gabo and cv Wyuna were 
grown in controlled growth cabinet conditions (18°C day and 13 C night, with a 

30 photoperiod of 16 h). Wheat leaves and florets prior to anthesis, and endosperm were 
collected over the grain filling period, immediately frozen in liquid nitrogen and stored 
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at -80°C until required. 

EXAMPLE 2 

Gel Electrophoresis, Antibodies and Immunoblotting 

5 Monoclonal antibodies against the Sgp-1 proteins, and their use in the immunoblotting 
of SDS-PAGE gels have been described previously (Rahman ef a/., 1995). 

EXAMPLE 3 
Preparation of total RNA from wheat 

10 Total RNA was isolated from the leaf, floret and endosperm tissues of wheat 
essentially as described by Higgins etal. (1976) or Rahman et al. (1998). RNA was 
quantified by UV absorption and by separation in 1.4% (w/v) agarose-formaldehyde 
gels which were then visualised under UV light after staining with ethidium bromide. 

15 EXAMPLE 4 

Construction and screening of cDNA libraries 

A first cDNA library, an expression cDNA library of wheat endosperm, was constructed 
from mRNA isolated from wheat cv Chinese Spring. RNA from 5, 7, 9, 1 1 and 1 3 days 
after anthesis was pooled and random primers were used for the first strand of cDNA 
20 synthesis. Monoclonal antibodies against 100 -105 kDa proteins in wheat starch 
granules (Rahman et al., 1995) were used for immunoscreening of the expression 
cDNA library. 

A second cDNA library was constructed from the endosperm mRNA of the hexaploid 
25 Triticum aestivum cultivar Wyuna, 8-12 days after anthesis, as described by Rahman 
ef al. (1997). This library was screened with a 85-bp cDNA fragment, wSSIIpl , which 
was obtained by immunoscreening of the expression cDNA library as described above. 
The wSSIIpl probe corresponded to nucleotide positions 988 to 1072 of wSSIIB (SEQ 
ID NO:1) at the hybridisation conditions as described earlier (Rahman ef al., 1998). 

30 

A third cDNA library was constructed from RNA from the endosperm of the hexaploid 
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Triticum aestivum cultivar Rosella as described by Rahman et al. (1997). This library 
was screened with a 347-bp cDNA fragment, wSSIIIpl for the first screening, and a 
478-bp cDNA fragment wSSIIIp3 for the second screening using the hybridisation 
conditions described herein. 

5 

EXAMPLE 5 

Construction and screening of Triticum tauschii genomic library 

The genomic library used in this study, prepared from Triticum tauschii, var 
strangulata, (Accession Number CP1 110799), has been described in Rahman et al., 
10 (1997). Of all the accessions of T. tauschii surveyed, DNA marker analysis suggests 
that the genome of CPI 110799 is the most closely related to the D genome of 
hexaploid wheat (Lagudah et al., 1991). 

Hybridisations were carried out in 25% formamide, 6 x SSC, 0.1 % SDS at 42°C for 16 
15 hours, then filters were washed 3 times using 2 x SSC containing 0.1 % SDS at 65°C 
for 1 hour per wash. 

For the isolation of a genomic wSSII clone, the probe comprised the PCR-derived DNA 
fragment wSSIIp2 and positive-hybridising plaques were digested using the restriction 
20 enzyme SamHI, separated on a 1% agarose gel, transferred to nitrocellulose 
membrane and hybridised to probe wSSIIp4 comprising nucleotides 1 to 367 of the 
wSSIIA cDNA clone, using the conditions described by Rahman etal. (1997). 

For the isolation of a genomic wSSIII clone, plaques hybridising to the PCR-derived 
25 DNA fragment wSSIIIpl from clone wSSIII.B3 (i.e. nucleotides 3620 to 3966 of SEQ 
ID NO:7) were selected and re-screened until plaque-purified. 

EXAMPLE 6 
DNA sequencing and analysis 

30 DNA sequencing was performed using the automated ABI system with dye terminators 
as described by the manufacturers. DNA sequences were analysed using the GCG 
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suite of programs (Devereaux et al., 1 984). 

EXAMPLE 7 
DNA and RNA analysis 

5 DNA was isolated and analysed as previously described (Maniatis et al., 1982; 
Rahman et a/., 1998). Approximately 20 ^9 of DNA was digested with restriction 
enzymes BamHI. Dra\ and EcoRI, separated on a 1% agarose gel and transferred to 
reinforced nitrocellulose membranes (BioRad) and hybridised with 32 P-labelled DNA 
probe, either wSSIIIpl , corresponding to nucleotides 3620 to 3966 of the wheat SSIII 
10 gene, or alternatively, with the entire wSSII cDNA clone. DNA fragment probes were 
labelled with the Rapid Multiprime DNA Probe Labelling Kit (Promega). 

The hybridisation and wash conditions were performed as described in Rahman et al. 
(1997). For RNA analysis, 10 /xg of total RNA was separated in a 1.4% agarose- 

15 formaldehyde gel and transferred to a Hybond N+ membrane (Amersham), and 
hybridised with cDNA probe at 42°C as previously described by Khandjian et al., 
(1 987) or Rahman et al., (1998). After washing for 30 minutes at 65°C with 2x SSC, 
0.1% SDS; followed by three washes of 40 minutes at 65°C with 0.2x SSC, 1% SDS, 
the membranes were visualised by overnight exposure at -80°C with Kodak MR X-ray 

20 film. 

EXAMPLE 8 

Expression of wheat Sgp-1 polypeptides in the wheat endosperm 

The development and use of monoclonal antibodies to the Sgp-1 proteins has been 
25 described previously (Rahman et al., 1995). These antibodies were used by the 
present inventors to characterise the expression and localisation of the Sgp-1 proteins. 

The proteins found in the matrix of the wheat starch granule are shown in Figure 1, 
lane 1 . The remaining lanes show an immunoblot of proteins from the soluble phase 
30 (Figure 1; lanes 2-4) and the starch granule (Figure 1; lanes 5-7), respectively, 
following SDS-PAGE. In addition to cross-reactivity with the 100-105 kDa proteins, a 
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weak cross-reaction with a 50 kDa protein in both the granule and the soluble fractions 
were observed (Figure 1). The Sgp-1 polypeptides are present in the starch granule 
throughout endosperm development (Figure 1; lanes 5-7, also see Rahman et ai t 
1995). However, as the endosperms matures, there is a reduction in the amount of 
5 Sgp-1 protein found in the soluble fraction. Lane 4 shows that by 25 days after 
anthesis, the level of these proteins in the soluble fraction is substantially reduced. 
This observation is consistent with previous results from Rahman et a/., (1995), who 
suggested that the Sgp-1 proteins were exclusively granule bound based on studies 
of granules from endosperm in mid-late stages endosperm development, however, 
10 these results suggest that the partitioning of these proteins between the granule and 
the soluble phase changes during development. 

EXAMPLE 9 

Isolation of cDNA clones encoding wheat starch synthase II (wSSII) proteins 

15 Monoclonal antibodies against Sgp-1 polypeptides (Rahman et a/., 1995) were used 
to probe the expression library described in Example 4 (i.e. the first cDNA library). 
Three immunoreactive plaques were identified and sequenced. One clone, designated 
wSSIIpl , contained an 85-bp cDNA insert with homology to maize SSIIa (Ham ef a/., 
1998). 

20 

DNA from the wSSIIpl clone was.used as a probe in the hybridisation screening of the 
second cDNA library, prepared from Triticum aestivum cultivar Wyuna endosperm RNA 
as described in Example 4. Ten hybridising cDNA clones were selected and 
sequenced. On the basis of the DNA sequences obtained, the 10 cDNA clones can be 
25 classified into three groups. Group 1 contains 7 cDNA clones, group 2 contains 2 
cDNA clones and group 3 contains 1 cDNA clone. 

The longest clone from group 1 (designated wSSIIB) is 2939 bp in length (SEQ ID 
NO:1) and encodes a 798 -amino acid polypeptide in the region from nucleotide 
30 position 176 to nucleotide position 2569 (SEQ ID NO:2). 
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The longest clone from group 2 (designated wSSIIA) is 2842 bp in length (SEQ ID 
NO:3) and encodes a 799 -amino acid polypeptide in the region from nucleotide 
position 89 to nucleotide position 2485 (SEQ ID NO:4). 

5 The cDNA from group 3 is a partial cDNA clone (designated wSSIID), which is 2107 
bp in length (SEQ ID NO:5) and encodes a 597 -amino acid polypeptide in the region 
from nucleotide position 1 to nucleotide position 1791 (SEQ ID NO:6). The encoded 
polypeptide is approximately a 200 amino acid residues shorter than that of 
polypeptides encoded by longest clones of group 1 or 2 clones, respectively (Figure 
10 2). 

Comparison of the three cDNA clones, wSSIIB, wSSIIA and wSSIID shows that they 
share 95.7% to 96.6% identity at the amino acid level, with variation at 44 amino acid 
positions between the three sequences (Figure 3). Of the 44 amino acid changes 

15 between these sequences, 31 changes occur in the N-terminal region (residues 1 to 
300), 10 changes occur in the central region (residues 301 to 729) and 3 changes 
occur in the C-terminal region (residues 730 to 799). The wSSIIA polypeptide (799 
amino acid residues) and wSSIIB polypeptide (798 amino acid residues) sequences 
differ in length by a single amino acid residue, due to the deletion of Asp-69 from the 

20 wSSIIB polypeptide sequence. 

A comparison of the nucleotide sequences of the wSSIA, wSSIIB and wSSIID cDNA 
clones with the nucleotide sequence of the wSSIIpl cDNA obtained by 
immunoscreening confirms that the wSSIIpl sequence is found in each cDNA (Figure 

25 3). The peptide encoded by the wSSIIpl cDNA clone corresponds to amino acid 
residues in the region from residue 272 to residue 298 of the wSSIIA polypeptide, and 
to amino acid residues in the region from residue 271 to residue 297 of the wSSIIB 
polypeptide (see Figure 3). Thus, the peptide epitope encoded by wSSIIpl that reacts 
with the anti-Sgp-1 monoclonal antibodies can therefore be localised to this region of 

30 the wSSIIA and wSSIIB polypeptides and to the corresponding region of the wSSIID 
polypeptide. 
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Notwithstanding that a region having about 63% amino acid sequence identity to the 
peptide epitope encoded by clone wSSIIpl is found in the maize SSIIa polypeptide 
(Figure 3), the degree of amino acid conservation between maize and wheat 
sequences in this region of the polypeptide is insufficient for immunological cross- 
5 reactivity to occur between these species using the monoclonal antibodies to the 
wheat Sgp-1 proteins described by Rahman et al. (1995). Additionally, this peptide 
epitope is not found in granule-bound starch synthases, SSI, or SSIII (data not shown). 

The wSSIIB cDNA (SEQ ID NO:1) encodes an amino acid sequence comprising the 
10 peptide motif AAGKKDAG ID (SEQ ID NO: 18) between residues 60 and 69 of SEQ ID 
NO:2 (Figure 3) which, with the exception of the second residue, is identical to the N- 
terminal of the 100 kDa (A T / L GKKDAGID: SEQ ID NOS:19 and 20) protein (Sgp-B1) 
from the wheat starch granule (note that the sequence given in Rahman et a/., 1995 
(A T / L GKKDAL: SEQ ID NOS: 21 and 22 ) has been revised following further amino acid 
15 sequence analysis). 

The wSSIIA cDNA clone (SEQ ID NO:3) encodes an amino acid sequence comprising 
the peptide motif AAGKKDARVDDDAA (SEQ ID NO: 23) at residues 60 to 73 of SEQ 
ID NO:4, which is about 66% identical to the N-terminal amino acid sequence (i.e. 
20 ALGKKDAGIVDGA: SEQ ID NO: 24) of the 104 kDa and 105 kDa starch granule 
proteins, Sgp-D1 and Sgp-A1 respectively, as determined by sequence analysis of 
isolated protein (Rahman et a/., 1995). 

Furthermore, Takaoka et al. (1997) reported the amino acid sequences of 3 
25 polypeptides obtained from sequencing starch granule proteins derived from the Sgp-1 
proteins. Peptide 3 described by Takaoka et al. (1997) corresponds to amino acid 
residues 378 to 387 of the amino acid sequence of the wSSIIA cDNA (SEQ ID NO:4; 
Figure 3). Peptides 1 and 2 described by Takaoka ef al. (1997) could not be detected 
in the amino acid sequences of the wSSII cDNA clones of the present invention, 
30 however peptide 1 of Takaoka et al. (1997) can be found in the amino acid sequences 
of SSI from maize, rice, wheat and potato (data not shown). 
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Denyer et al. (1995) demonstrated that the Sgp-1 proteins possess starch synthase 
activity and, as a consequence, the wSSIIB, wSSIA and wSSIID cDNA clones encode 
starch synthase enzymes that are differentially expressed in a developmentally- 
regulated manner in both the soluble and granule-bound fractions of the endosperm 
5 (Figure 1). Based on the nomenclature suggested by Ham et al. (1998), it is 
appropriate to describe the Sgp-1 proteins as "starch synthases" rather than "granule- 
bound starch synthases". 

EXAMPLE 10 

10 Analysis of wheat starch synthase II mRNA expression 

The mRNA for wheat starch synthase II could be detected in leaves, pre-anthesis 
florets and endosperm of wheat when total RNAs isolated from these tissue were 
probed with a PCR probe, wSSIIp2, corresponding to nucleotide positions 1435 to 
1835 bp of wSSIIB-cDNA (SEQ ID NO:1; Figure 4). Unlike wSSI, which could not be 

15 detected in wheat leaves derived from plants grown under the same conditions, wSSII 
genes are highly-expressed in the leaves (Figure 4, lane 1), and expressed at an 
intermediate level in pre-anthesis florets (Figure 4, lane 2), and at much lower levels 
in developing wheat endosperm cells (Figure 4, lanes 3-11). In contrast, the maize 
SSIIa is expressed predominantly in the endosperm, whilst the maize SSIIb is detected 

20 mainly in the leaf, albeit at low levels (Ham et al., 1 998). 

The wSSII mRNA was detectable in the endosperm 6 days after anthesis and mRNA 
levels increase between 8 and 18 days post-anthesis, after which time levels of mRNA 
decline. 

25 

Southern blotting experiments in wheat demonstrated that the wSSIIp2 probe used 
detected only a single copy of the SSII gene in each genome (data not shown). Thus, 
it is unlikely that this probe cross-hybridised with mRNAs encoded by genes other than 
wSSII. 

30 
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EXAMPLE 11 

Chromosomal localization of the wheat wSSII genes. 

I. Amplification of specific cDNA regions of whe at starch synthase II using PCR 
Two PCR products, wSSIIp2 and wSSIIp3 were amplified from the cDNA clone wSSIIB 

5 and used for the northern hybridisation and Southern hybridisation, respectively. 

The primers sslla (5' TGTTGAGGTTCCATGGCACGTTC 3': SEQ ID NO: 25) and ssllb 
(5* AGTCGTTCTGCCGTATGATGTCG 3': SEQ ID NO: 26) were used to amplify the 
cDNA fragment wSSIIp2 (i.e. nucleotide positions 1435 to 1835 of SEQ ID NO:1). 

10 

The primers ssllc (5* CCAAGTACCAGTGGTGAACGC 3': SEQ ID NO: 27) and sslld 
(5* CGGTGGGATCCAACGGCCC 3': SEQ ID NO: 28) were used to amplify the cDNA 
fragment wSSIIp3 (i.e. nucleotide positions 2556 to 2921 of SEQ ID NO:1 ). 

15 The amplification reactions were performed using a FTS-1 thermal sequencer (Corbett, 
Australia) for 1 cycle of 95°C for 2 minutes; 35 cycles of 95°C for 30 seconds, 60°C for 
1 minutes, 72°C for 2 minutes and 1 cycle of 25°C for 1 minute. 

II. PCR and nucleotide sequence analysis of 3' seouences of wheat SSII genes 

20 Genomic DNA was extracted from wild-type Chinese Spring wheat, and from three 
nullisomic-tetrasomic lines of chromosome 7 of Chinese Spring wheat, and from 
Triticum tauschii (var strangulata, accession number CPI 100799), and used as a 
template for the amplification and nucleotide sequence analysis of wheat SSII genes. 

25 RFLP analysis of SamHI and EcoRI restricted DNA from each wheat or T. Tauschii line 
was carried out using the wSSIIp3 fragment as a probe. Three hybridising bands were 
obtained which could be assigned to chromosomes 7A, 7B and 7D, respectively (data 
not shown). This analysis indicates that there is a single copy of the wSSII gene in 
each genome in hexaploid wheat, consistent with the findings of Yamamori and Endo 

30 (1996) who located the SGP-A1, B1 and D1 proteins to the short arm of chromosome 
7. 
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PCR analysis was used to assign each of the cDNA clones to the individual wheat 
genomes. A single 365 bp PCR fragment was obtained from nullisomic-tetrasomic 
genomic DNA of Chinese Spring when primers ssllc and sslld were used for the PCR 
amplification (Figure 5, right panel). This PCR product is obtained only from lines 
5 bearing the B genome. The fragment was cloned and sequenced and shown to be 
identical to a 365 bp region of the wSSIIB cDNA. An identical fragment is obtained by 
PCR amplification of the wSSIIB cDNA clone, but not by amplification of the wSSIIA 
or wSSIID clones, supporting the conclusion that the wSSIIB cDNA is the product of 
a gene located on chromosome 7 of the B genome of hexaploid wheat. 

10 

Two PCR products were also amplified from nullisomic-tetrasomic genomic DNA of 
Chinese Spring using the primers ssllc and sslle (Figure 5, left panel). One PCR 
fragment, approximately 350 bp is only amplified when the A genome is present, and 
a second 322 bp product is only amplified when the D-genome is present. The 350 and 
15 322 bp PCR products were also cloned and sequenced and shown to be identical to 
the wSSIIA and wSSIID cDNAs, respectively, supporting the conclusion that the 
wSSIIA and wSSIID cDNAs are the products of genes located on chromosomes 7A 
and 7D, respectively. 

20 EXAMPLE 12 

Isolation of genomic wSSII clones 

Screening of a genomic library from the D-genome donor of wheat, T. tauschii, was 
performed as described in Example 5, using the PCR-derived DNA fragment wSSIIp2 
as a hybridisation probe. A positive-hybridising clone, designated wSSII-8, and 
25 comprising a putative T. tauschii homologue of the wSSII gene, was isolated. 

Positive-hybridising plaques were digested using the restriction enzyme BamH\, 
separated on a 1% agarose gel, transferred to nitrocellulose membrane and hybridised 
to probe wSSIIp4 comprising nucleotides 1 to 367 of the wSSIIA cDNA clone, using 
30 the conditions described by Rahman et a/. (1997). Clone wSSII-8 also hybridises 
strongly to the wSSIIp4 probe, confirming its identity as a genomic wSSII gene. 
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The complete nucleotide sequence of the wSSII gene was determined and is 
presented herein as SEQ ID NO: 37. The structural features of this gene are present 
in Table 3. A schematic representation of the intron/exon organisation of this gene is 
also presented in Figure 6. 

5 

TABLE 3 



Structural features of the wheat starch synthase II genomic gene 



Mi iflorktirfo Prtcifinn 
ivUui tzuiivit? rudiuuii 

in SEQ ID NO: 37 




Lenath fbases) 


1- 1416 


S'-untranscribed region and 
promoter sequence 


1416 


1417-1743 


exon 1 


327 


1480-1482 


translation start codon (ATG) 


3 


1744-1847 


intron 1 


104 


1848-2553 


exon 2 


706 


2554- 2641 


intron 2 


88 


2642 - 2706 


exon 3 


65 


2707 - 3606 


intron 3 


900 


3607-3684 


exon 4 


78 


3685 - 3773 


intron 4 


89 


3774 - 3884 


exon 5 


111 


3885-3981 


intron 5 


97 


3982 - 4026 


exon 6 


45 


4027-4406 


intron 6 


380 


4407 - 4580 


exon 7 


174 i 


4581 - 7296 


intron 7 


2716 


7297 - 8547 


exon 8 


1251 


8251 - 8253 


translation stop codon (TGA) 


3 


8548 -9024 


S'-untranscribed region 


477 
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EXAMPLE 13 

Cloning of specific cDNA regions of wheat starch synthase III using RT-PCR 

PCR primers were used to amplify sequences of starch synthase III from wheat 
endosperm cDNA. The design of PCR primers was based on the sequences of starch 
5 synthase III from potato and the du1 starch synthase III gene of maize. 

First-strand cDNAs were synthesised from 1 ^g of total RNA (derived from endosperm 
of the cultivar Rosella, 12 days after anthesis) as described by Maniatis et al. (1982), 
and then used as templates to amplify two specific cDNA regions, wSSIIIpl and 
10 wSSIIIp2, of wheat starch synthase III by PCR. 

The primers used to obtain the cDNA clone wSSIIIpl were as follows: 
Primer wSS3pa (5' GGAGGTCTTGGTGATGTTGT 3': SEQ ID NO: 29); and 
Primer wSS3pb (5' CTTGACCAATCATGGCAATG 3': SEQ ID NO: 30). 

15 

The primers used to obtain the cDNA clone wSSIIIp2 were as follows: 
Primer wSS3pc (5' CATTGCCATGATTGGTCAAG 3': SEQ ID NO: 31); and 
Primer wSS3pd (5* ACCACCTGTCCGTTCCGTTGC 3': SEQ ID NO: 32). 

20 The amplified clones wSSIIIpl and wSSIIIp2 were used as probes to screen the third 
cDNA library and T. tauschii genomic DNA library as described in Example 4. 

A further probe designated wSSIIIp3 was used for screening the third cDNA library, as 
described in Example 4. Probe wSSIIIp3 was amplified by PCR from a cDNA clone 
25 produced from the first screening using the following amplification primers: 

Primer wSS3pe (5' GCACGGTCTATGAGAACAATGGC 3': SEQ ID NO: 33); and 
Primer wSS3pf (5' TCTGCATACCACCAATCGCCG 3": SEQ ID NO: 34). 

The amplification reactions were performed using a FTS-1 or FTS4000 thermal 
30 sequencer (Corbett. Australia) for 1 cycle of 95°C for 2 minutes; 35 cycles of 95°C for 
30 seconds, 60°C for 1 minutes, 72°C for 2 minutes and 1 cycle of 25°C for 1 minute. 
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Amplified sequences of the expected length were obtained, cloned and sequenced, 
and shown to contain DNA sequences highly homologous to the maize and potato 
SSIII genes. PCR fragments were subsequently used to probe a wheat cDNA library 
5 by DNA hybridisation and 8 positive clones were obtained, including one 3 kb cDNA. 
A region from the 5' end of this cDNA was amplified by PCR and used a probe for a 
second round of screening the cDNA library, obtaining 8 cDNA clones. Of these, one 
cDNA was demonstrated to be full length (wSSIII.B3, 5.36 kb insert). The sequence 
of the 5,346 bp wSSIII.B3 cDNA clone is given in SEQ ID NO:7. 

10 

Sequencing of the 8 cDNA clones obtained from the second round screening of the 
wheat cDNA library revealed that there were at least 2 classes of cDNA encoding SSIII 
present, possibly being encoded by homeologous genes on different wheat genomes. 
The sequence of a representative of this second class of cDNA clones, wSSIII.BI , is 
15 shown in SEQ ID NO:9. The 3261 bp clone wSSIII.BI is not full length, however it is 
similar to nucleotides 1739 to 5346 of the homeologous done wSSIII.B3 (SEQ ID NO: 
7). Clone wSSIII.BI has an open reading frame between nucleotide positions 1 and 
3177. 

20 An open reading frame is found in the cDNA clone wSSIII.B3 (SEQ ID NO:7), in the 
region between position 29, commencing the ATG start codon, and nucleotide position 
4912. The amino acid sequence deduced from this open reading frame is shown in 
SEQIDNO:8. 

25 An alignment of the deduced amino acid sequences of SSIII from maize, potato and 
wheat is shown in Figure 7. There is about 56.6% identity between the maize SSIII and 
wheat wSSIII.B3 sequence at the amino acid level. 

The C-terminal domain of starch synthases comprise the catalytic domain, and a 
30 characteristic amino acid sequence motif KVGGLGDWTSLSRAVQDLGHNVEV (SEQ 
ID NO: 35) in maize, or alternatively KVGGLGDWTSLSRAIQDLGHTVEV (SEQ ID 
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NO: 36) in wheat, marking the first conserved region in the C-terminal domain. This 
amino acid sequence is present at amino acid residues 1 194 to 1218 of SEQ ID NO: 
8. 

5 The amino acid identity between maize dulH and WSSIII.B3 in the N-terminal region 
(i.e. amino acids 1 to 600 in Figure 7) is only 32.2%; whilst the amino acid identity in 
the central region (i.e. amino acids 601 to 1248 in Figure 7) is 68.4%; and in the C- 
terminal region (i.e. amino acids 1249 to 1631 in Figure 7) is 84.6%. Accordingly, the 
SSIII starch synthases are much more highly conserved between maize and wheat in 

10 the region comprising the catalytic domain of the proteins. 

EXAMPLE 14 

Analysis of wheat starch synthase III mRNA expression 

Figure 8 shows the expression of wSSIII mRNA during endosperm development in two 
15 wheat varieties grown under defined environmental conditions. The expression of the 
gene is seen very early in endosperm development in both cUltivars, 4 days after 
anthesis (Figure 8, panels a and b). Expression in the leaf of the variety Gabo is very 
weak (Figure 8, panel c, Lane L) whereas strong expression is seen in pre-anthesis 
florets (Figure 8, panel c, Lane P). 

20 

EXAMPLE 15 
Amino acid sequence comparisons between 
wheat SSII and SSIII polypeptides 

Amino acid sequence comparisons between wheat BSSS, SSI, SSII and SSIII 
25 polypeptides reveals eight highly-conserved domains (Figure 9). The amino acid 
sequences of these domains are represented in the wheat SSIII amino acid sequence 
by the following sequence motifs: 

(a) Region 1: KVGGLGDWTS; 

(b) Region 2: GHTVEVILPKY; 

30 (c) Region 3: HDWSSAPVAWLYKEHY; 

(d) Region 4: GILNGIDPDIWDPYTD; 
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(e) Region 5: DVPI VG I ITRLTAQKG ; 

(f) Region 5a: NGQVVLLGSA; 

(g) Region 6: AGSDFIIVPSIFEPCGLTQLVAMRYGS; and 

(h) Region 7: TGGLVDTV. 

5 

These conserved amino acid sequences are summarised in Table 4. As shown in 
Table 4 below, there is at least about 25% amino acid sequence identity, preferably 
at least about 30% amino acid sequence identity, more preferably at least about 35% 
amino acid sequence identity, more preferably at least about 40% amino acid 

10 sequence identity, more preferably at least about 45% amino acid sequence identity, 
more preferably at least about 50% amino acid sequence identity, more preferably at 
least about 55% amino acid sequence identity, more preferably at least about 60% 
amino acid sequence identity, more preferably at least about 65% amino acid 
sequence identity, more preferably at least about 70% amino acid sequence identity, 

15 more preferably at least about 75% amino acid sequence identity, more preferably at 
least about 80% amino acid sequence identity, more preferably at least about 85% 
amino acid sequence identity, more preferably at least about 90% amino acid 
sequence identity and even more preferably at least about 95% amino acid sequence 
identity between the amino acid sequences of plant starch synthase enzymes, in 

20 particular wheat starch synthases. 

From the data presented in Table 4, the most conserved regions of the wheat SSII 
and SSI 1 1 polypeptides are a region of 6 or 7 identical amino acids in Region 1 and a 
region of 8 or 9 identical amino acids in Region 6. The lowest regions of identity are 
25 found in regions 3 and 5a. 

For each of the amino acid sequences presented in the first column of Table 4, which 
are specific for wSSIII polypeptides, corresponding signature motifs which are specific 
for wSSII-A, wSSII-B, and wSSII-D polypeptides can be derived from the alignment, 
30 as follows: 

Region 1: KTGGLGDVAGA; 
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Region 2: 


GHRVMWVPRY; 


Region 3: 


NDWHTALLPVYLKAYY; 


Region 4: 


GIVNGIDNMEWNPEVD; 


Region 5: 


DVPLLGFIGRLDGQKG; 


Region 5a: 


DVQLVMLGTG; 


Region 6: 


AGADALLMPS RF(EA/)PCG LNQL YAM AYGT; and 


Region 7: 


VGG(V/L)RDTV. 



Comparison of the amino acid sequences of all available starch synthases with the 
10 deduced amino acid sequences of the three wSSII cDNA clones of the present 
invention (i.e. wSSIIB, wSSIIA and wSSIID) was conducted using PILEUP analysis 
(Devereaux et ai, 1984) and data are presented herein as a dendrogram (Figure 10). 
The sequence of the glycogen synthase of £ coli was also included. Based upon their 
amino acid similarities, four classes of plant starch synthases can be defined: GBSS, 
15 SSI, SSII and SSIII. 

Table 5 shows that levels of identity at the amino acid level between the wSSII 
sequences, as determined using the BESTFIT programme in GCG (Devereaux et ai, 
1984), and other class II starch synthases range from 70% identity with potato SSII to 
20 85% identity with maize SSIIa. Both wSSIIB and wSSIID showed significantly higher 
homology to maize SSIIa than wSSI(A. Based upon sequence identities and the 
function of the Sgp-1 proteins in wheat, the wSSIIB, wSSIIA and wSSID cDNA clones 
are members of the starch synthase II (SSII) group and are more similar in sequence 
to maize SSIIa than maize SSIIb. 

25 
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TABLE 4 



Identities between conserved motifs of plant starch synthases 



Sequence in wSSIII 
polypeptide 


Number of conserved 
residues between wheat 
starch synthases 


Number of conserved 
residues between 
wheat SSII and SSIII 
polypeptides 


Region 1: 
KVGGLGDWTS 


6/1 1 residues 


6/11 residues 


Region 2: 
GHTVEVILPKY 


6/11 residues 


6/11 residues 


Region 3: 

HDWSSAPVAWLYKEHY 


4/16 residues 


5/16 residues 


Region 4: 

GILNGIDPDIWDPYTD 


7/16 residues 


8/16 residues 


Region 5: 

DVPIVGIITRLTAQKG 


8/16 residues 


10/16 residues 


Region 5a: 
NGQWLLGSA 


4/10 residues 


4/10 residues 


Region 6: 

AGSDFIIVPSIFEPCGLT 
QLVAMRYGS 


15/27 residues 


17/27 residues 


Region 7: 
TGGLVDTV 


5/9 residues 


5/9 residues 
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TABLE 5 





wSSII-A 


wSSII-B 


wSSII-D 


wSSI-A 


100% 






wSSII-B 


95.9% 


100% 




wSSII-D 


96.3% 


96.7% 


100% 


maize SSIIa 


76.1% 


85.2% 


84.7% 


maize SSIIb 


76.3% 


76.7% 


75.9% 


pea SSII 


72.0% 


72.2% 


71.8% 


potato SSII 


70.9% 


71.1% 


70.3% 



10 



Figure 1 1 shows a schematic representation of an alignment of plant starch synthase 
sequences, including wheat GBSS, wheat SSI, wheat SSII-A1, maize SSIIa, and 
maize dull-1 polypeptides, in which the position of the first homologous region, 
comprising the consensus motif KXGG, is used as the basis of the alignment. The 

15 major differences in structure between the classes of genes are found in the length of 
the N-terminal region between the transit peptide and the first conserved region. At 
one extreme, the GBSS genes have a very short N-terminal arm, whereas the du1 
starch synthase contains a very long N-terminal extension containing several distinct 
regions. The wSSII genes contain an N-terminal extension which is longer than either 

20 GBSS, SSI, or SSIIb, and slightly longer than the maize SSIIa gene. 

EXAMPLE 16 
Isolation of genomic clones for SSIII 

Screening of a genomic library from the D-genome donor of wheat, T. tauschii, 
25 identified a number of clones which hybridised to the wSSIII PCR fragment. Positive 
plaques in the genomic library were selected as those hybridising with a probe that had 
been generated by PCR (amplifying between nucleotide positions 3620 to 3966) from 
the SSIII cDNA as template. The primer sequences used were as follows: 
wSS3pa (5" GGAGGTCTTGGTGATGTTGT 3': SEQ ID NO: 29); and 
30 wSS3pb (5' CTTGACCAATCATGGCAATG 3* : SEQ ID NO: 30). 
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Hybridisation was carried out in 25% formamide, 6 x SSC, 0.1% SDS at 42 °C for 16 
hour, then washed three times with 2 x SSC containing 0.1% SDS at 65 °C, for 1 hour 
per wash, shows an example of a plaque lift showing positive and negative 
hybridisations for plaques containing the T. tauschii homologue of the wSSIII.B3 gene. 

5 

DNA was isolated from positive-hybridising A clones using methods described by 
Maniatis et al. Briefly, DNA was digested using BamH\ or Bg/I and sub-cloned in to the 
vector pJKKmfm. DNA sequencing was performed using the automated ABI system 
with dye terminators as described by the manufacturers. DNA sequences were 
10 analysed using the GCG suite of programs (Devereaux et ai, 1984). 

Nucleotide sequences of the genomic SSIII clone from 7. tauschii are provided herein 
as 6 contiguous sequences designated fragments 1 to 6 (SEQ ID NOs: 11 to 16, 
respectively). Table 6 defines the relative positions of these fragments with respect to 
15 the SSIII cDNA and describes the positions of exons. Figure 1 1 shows this information 
schematically. 

The complete nucleotide sequence of a wheat SSIII genomic gene is presented herein 
as SEQ ID NO: 38. The structural features of this gene are presented in Table 7, A 
20 schematic representation of the intron/exon organisation of this gene is also presented 
in Figure 12. 

EXAMPLE 17 
Discussion 

25 Eariy work on the Sgp-1 starch synthase proteins (Denyer et a/., 1995; Rahman ef a/., 
1995) was based on the localisation of these proteins in the wheat starch granule, and 
no definitive conclusion concerning their presence or absence in soluble extracts of the 
wheat endosperm was presented. 

30 We have now demonstrated that a monoclonal antibody against the Sgp-1 proteins 
cross reacts strongly with those starch synthase proteins having apparent molecular 
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weights of 100-105 kDa in soluble extracts, however, the appearance of these proteins 
in soluble extracts is dependant on the developmental stage of the endosperm 
material. Whilst the proteins can be detected in the soluble phase in early to mid 
endosperm development, little or no soluble protein remains in late endosperm 
5 development (Figure 1). This observation accounts for the failure of Rahman et al. 
(1995) to detect the protein in soluble extracts in a previous report. 

Based upon the localisation of the Sgp-1 starch synthase proteins in the wheat 
endosperm, the following nomenclature is suggested for wheat starch synthase 
10 enzymes: wGBSS for the 60 kDa granule bound starch synthase (Wx); wSSI for the 
75 kDa starch synthase I (Sgp-3); wSSII for the 100-105 kDa proteins (Sgp-1 ); and 
wSSIII for a soluble high molecular starch synthase. 

The present invention provides cDNA and genomic clones encoding the wSSII and 

15 wSSIII polypeptides and the corresponding genomic clones. Whilst the evidence is 
compelling that the wSSIIA, wSSIIB and wSSIID cDNAs encode the Sgp-A1 , Sgp-B1 
and Sgp-D1 proteins of the wheat starch granule, molecular weights calculated from 
the deduced amino acid sequences of the clones are considerably lower than 
estimates obtained from SDS-PAGE. The molecular weight of the precursor wSSIIA 

20 protein is 87,229 Da, and the mature protein 81 ,164 Da, yet the estimated molecular 
weight in our experience is 105 kDa. The assignment of the wSSIIA cDNA to the A- 
genome of wheat is demonstrated in Figure 5, and the assignment of the 105 kDa 
protein to the A-genome in Denyer et al. (1995) and Yamamori and Endo (1996). 
Similarly, the molecular weight of the wSSliB protein is 86,790 Da and the mature 

25 protein 80,759 Da, yet the molecular weight of the Sgp-B1 protein is estimated to be 
100 kDa. No comparison can be made of the wSSIID sequences as a full length cDNA 
clone was not obtained. The wSSIIA and wSSIIB amino acid sequences differ by just 
a single amino acid residue, yet there is an apparent difference of 5 kDa in molecular 
weight when estimated by SDS-PAGE. Several possibilities can be advanced to 

30 account for this apparent discrepancy in molecular weights. Firstly, the wSSII proteins 
may not migrate in SDS-PAGE in accordance with their molecular weight because they 
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retain some conformation under the denaturing conditions used. Secondly, the proteins 
may be glycosylated. It is also possible that the proteins may be non-covalently linked 
to starch through a high affinity starch binding site which survives denaturation and 
SDS-PAGE. Differences between the apparent molecular weights and those calculated 
5 from the deduced amino acid sequences will have to be defined in establishing the 
relationship between the wSSII proteins and proteins encoded by the analogous SSII 
genes of other species. 

The catalytic domain of the starch synthases is found at the C-terminal end of the 
10 protein (Gao era/.. 1998; Ham era/., 1998). Ham etal. (1998) identified 7 conserved 
regions among SSIIa, SSIIb, SSI and GBSS sequences. We have identified an 
additional conserved region (designated region 5a in Table 4 and Figure 10) 
comprising the amino acid sequence motif DVQLVMLGTG, by a comparison of the 
wSSII and wSSIII sequences of the present invention with differing isoforms of other 
15 plant starch synthases (GBSS, SS1 , SSII and SSIII). The conservation of eight peptide 
regions among the 4 classes of starch synthases is striking, in terms of their sequence 
homologies and their alignment. 

Analysis of the wheat SSII genes shows that there is a motif, PVNGENK, which is 
20 repeated. The area surrounding the repeated PVNGENK motif is not homologous to 
maize SSIIa and the insertion of this region is responsible for the difference in length 
between the wheat SSII and maize SSIIa genes. In pea and potato SSII polypeptides, 
a PPP motif (Figure 3; residues 251-253 and 287-289 respectively) has been 
suggested to mark the end of the N-terminal region and to facilitate the flexibility of an 
25 "N-terminal arm". This motif is not found in either the maize or wheat SSII sequences. 

The generation of a wheat line combining null alleles at each of the three wSSII loci, 
wSSIIA, wSSIIB and wSSIID, has been reported recently by Yamamori (1998). In this 
triple null line, the large starch granules were reported to be mostly deformed and a 
30 novel starch with high blue value was observed when stained with iodine, indicating 
that wSSII is a key enzyme for the synthesis of starch in wheat. Further analysis of the 
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starch derived from this triple null mutant is in progress. 

Mutations in starch synthases are known in three other species. In pea, mutation in 
SSII gives rise to starch with altered granule morphology and an amylopectin which 
5 yields an oligosaccharide distribution with reduced chain length on debranching, 
compared to the wild type (Craig et a/., 1998). A similar mutation in a gene designated 
SSII is known in Chlamydomonas (the ste-3 mutation) and similar effects on granule 
morphology and amylopectin structure are observed (Fontaine et a/., 1993). In maize, 
two mutations affecting starch synthases are known. First, the dulH mutation has been 

10 shown to be caused by a lesion within the cful SSIII-type starch synthase gene (Gao 
et a/., 1998). A second mutation, the sugary-2 mutation yields a starch with reduced 
amylopectin chain lengths on debranching (this mutation co-segregates with the SSIIa 
locus (Harn et a/., 1 998) although direct evidence that the sugary-2 mutation is caused 
by a lesion in the SSIIa gene is lacking). In the SSII mutants of each of these species, 

15 amylose biosynthesis capacity is retained, suggesting different roles in amylose and 
amylopectin synthesis for the GBSS and SSII genes. Given the conservation in overall 
organisation of the GBSS and SSII genes (see Figures 12 and 13), when an alignment 
is made based on the KTGGL motif of the first conserved region, this focuses attention 
on the role(s) of the N-terminal region in defining substrate specificity and the 

20 localisation of the proteins as the N-terminal region is the major area of divergence 
between the 4 classes of starch synthases. However, it is premature to exclude the 
influence of more subtle mutations in central and C-terminal regions of the gene. 

The cloning of the wSSII and wSSIII cDNAs and genomic clones described herein 
25 provides useful tools for the further study of the roles of the starch synthases in wheat. 
Firstly, they provide a source of markers which can be used to recover and combine 
null or divergent alleles. Secondly, genetic manipulation of wheat by gene suppression 
or over-expression can be carried out, and the genes may be used for over expression 
in other species. The promoter regions of these genes are also useful in regulating the 
30 expression of starch synthase genes and other heterologous genes in the developing 
wheat endosperm and in pre-anthesis florets of wheat. 
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TABLE 7 



Structural features of the wheat starch synthase III genomic gene 



Nucleotide Position 
in SEQ ID NO: 38 


Feature 


Length (bases) 


1-973 


5'-untranscribed region and 
promoter sequence 


973 


974-1099 


exon 1 


126 


1001-1003 


translation start codon (ATG) 


3 


1100-2056 


intron 1 


957 


2057-2120 


exon 2 


64 


2121 -2588 


intron 2 


468 


2589 - 5291 


exon 3 


2703 


5292 - 5549 


intron 3 


258 


5550 - 5767 


exon 4 


218 


5768-6103 


intron 4 


336 


6104-6374 


exon 5 


271 


6375-7148 


intron 5 


774 


7149 - 7324 


exon 6 


176 


7325 - 7438 


intron 6 


114 


7439 - 7546 


exon 7 


108 


7547 - 7792 


intron 7 


246 


7793 - 7902 


exon 8 


no 


7903-8797 


intron 8 


895 


8798 - 8900 


exon 9 


103 


8901 -9164 


intron 9 


264 


9165-9335 


exon 10 


171 


9336 - 9460 


intron 10 


125 


9461 - 9589 


exon 1 1 


129 


9590 - 9677 


intron 1 1 


88 
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9678 - 9860 


exon 12 


183 


9861 - 9977 


intron 1 2 


a a ~y 

117 


9978-10109 


exon 1 3 


132 


10110-10205 


intron 13 


96 


10206-10317 


exon 14 


112 


10318-10407 


•i A A 

intron 14 


90 


10408-10536 


exon 1 5 


129 


10537-10618 


intron 15 


82 


10619-11146 


exon 16 


128 


10744-10746 


translation stop codon (TGA) 


3 


11147-11611 


3'-untranscribed region 


465 
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CLAIMS: 

1 . An isolated nucleic acid molecule which comprises a sequence of nucleotides selected 
from the group consisting of: 

(i) the nucleotide sequence set forth in SEQ ID NO: 1 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(ii) the nucleotide sequence set forth in SEQ ID NO: 3 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(iii) the nucleotide sequence set forth in SEQ ID NO: 5 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(iv) the nucleotide sequence set forth in SEQ ID NO: 7 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(v) the nucleotide sequence set forth in SEQ ID NO: 9 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(vi) the nucleotide sequence set forth in SEQ ID NO: 1 1 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(vii) the nucleotide sequence set forth in SEQ ID NO: 12 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(viii) the nucleotide sequence set forth in SEQ ID NO: 13 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(ix) the nucleotide sequence set forth in SEQ ID NO: 14 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(x) the nucleotide sequence set forth in SEQ ID NO: 15 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xi) the nucleotide sequence set forth in SEQ ID NO: 16 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xii) the nucleotide sequence set forth in SEQ ID NO: 37 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xiii) the nucleotide sequence set forth in SEQ ID NO: 38 or the protein-encoding 
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region thereof or a degenerate nucleotide sequence thereto; 

(xiv) the nucleotide sequence set forth in SEQ ID NO: 1 1 or the protein-encoding 
region thereof or a degenerate nucleotide sequence thereto; 

(xv) a nucleotide sequence which encodes a wheat starch synthase polypeptide as 
hereinbefore defined wherein said nucleotide sequence has at least about 85% identity 
overall to any one of (i) to (xiv); and 

(xvi) a nucleotide sequence which is complementary to any one of (i) to (xv). 

2. The isolated nucleic acid molecule according to claim 1 wherein the wheat starch 
synthase polypeptide further comprises one or more amino acid sequences selected 
from the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 
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(o)AGADALLMPSRF(E/V)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV. 

3. The isolated nucleic acid molecule according to claim 2 wherein the wheat starch 
synthase polypeptide comprises at least three of said amino acid sequences selected 
from the group consisting of (a) to (h). 

4. The isolated nucleic acid molecule according to claim 2 wherein the wheat starch 
synthase polypeptide comprises at least six of said amino acid sequences selected 
from the group consisting of (i) to (p). 

5. The isolated nucleic acid molecule according to claim 1 encoding a wheat starch 
synthase I! polypeptide. 

6. The isolated nucleic acid molecule according to claim 1 encoding a wheat starch 
synthase III polypeptide. 

7. An isolated nucleic acid molecule encoding a starch synthase polypeptide which 
comprises one or more amino acid sequences selected from the group consisting of: 

(a) GHTVEVILPKY; 

(b) HDWSSAPVAWLYKEHY; 

(c) DVPIVGIITRLTAQKG; 

(d) NGQWLLGSA; 

(e) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(f) TGGLVDTV; 

(g) GIVNGIDNMEWNPEVD; and 
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(h) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT. 

8. The isolated nucleic acid molecule of claim 5 encoding a wheat starch synthase II 
polypeptide which comprises an amino acid sequence selected from the group 
consisting of: 

(i) SEQIDNO:2; 

(ii) SEQIDNO:4; 

(iii) SEQ ID NO: 6; and 

(iv) a homologue of any one of (i) to (iii) having at least about 85% identity thereto. 

9. The isolated nucleic acid molecule of claim 6 encoding a wheat starch synthase III 
polypeptide which comprises an amino acid sequence selected from the group 
consisting of: 

(i) SEQ ID NO: 8; 

(ii) SEQ ID NO: 10; and 

(iii) a homologue of (i) or (ii) having at least about 85% identity thereto. 

1 0. A probe or primer comprising at least about 1 5 contiguous nucleotides in length derived 
from the nucleotide sequence according to claim 1 . 

1 1 . The probe or primer according to claim 1 0 comprising a nucleotide sequence selected 
from the group consisting of: 

(i) the nucleotide sequence set forth in SEQ ID NQ: 25; 

(ii) the nucleotide sequence set forth in SEQ ID NO: 26; 

(iii) the nucleotide sequence set forth in SEQ ID NO: 27; 
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(iv) the nucleotide sequence set forth in SEQ ID NO: 28; 

(v) the nucleotide sequence set forth in SEQ ID NO: 29; 

(vi) the nucleotide sequence set forth in SEQ ID NO: 30; 

(vii) the nucleotide sequence set forth in SEQ ID NO: 31 ; 

(viii) the nucleotide sequence set forth in SEQ ID NO: 32; 

(ix) the nucleotide sequence set forth in SEQ ID NO: 33; 

(x) the nucleotide sequence set forth in SEQ ID NO: 34; 

(xi) a nucleotide sequence which encodes an amino acid sequence selected from 
the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
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(p)VGG(V/L)RDTV; 

(xii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 37; 

(xiii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 38; and 

(xiv) a nucleotide sequence which is complementary to any one of (i) to (xiii). 

12. An isolated or recombinant polypeptide, protein or enzyme comprising an amino acid 
sequence selected from the following: 

(i) the amino acid sequence set forth in SEQ ID NO: 2 or the mature protein region 
thereof; 

(ii) the amino acid sequence set forth in SEQ ID NO: 4 or the mature protein region 
thereof; 

(Hi) the amino acid sequence set forth in SEQ ID NO: 6 or the mature protein region 
thereof; 

(iv) the amino acid sequence set forth in SEQ ID NO: 8 or the mature protein region 
thereof; 

(v) the amino acid sequence set forth in SEQ ID NO: 10 or the mature protein 
region thereof; 

(vi) a wheat starch synthase polypeptide having at least about 85% identity overall 
to any one of (i) to (v). 

1 3. The isolated or recombinant polypeptide according to claim 1 2 further comprising one 
or more amino acid sequences selected from the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 



WO 00/66745 



PCT/AUOO/00385 



- 88 - 

(d) GILNGIDPDIWDPYTD; 

(e) DVPIVGIITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVD7V; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(EA/)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV. 

1 4. The isolated or recombinant polypeptide according to claim 1 3 wherein the wheat starch 
synthase polypeptide comprises at least three of said amino acid sequences selected 
from the group consisting of (a) to (h). 

1 5. The isolated or recombinant polypeptide according to claim 1 3 wherein the wheat starch 
synthase polypeptide comprises at least six of said amino acid sequences selected 
from the group consisting of (i) to (p). 

1 6. The isolated or recombinant polypeptide according to claim 1 2 encoding a wheat starch 
synthase II polypeptide. 

I 
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1 7. The isolated or recombinant polypeptide according to claim 1 2 encoding a wheat starch 
synthase III polypeptide. 

1 8. An isolated or recombinant starch synthase polypeptide which comprises one or more 
amino acid sequences selected from the group consisting of: 

(a) GHTVEVILPKY; 

(b) HDWSSAPVAWLYKEHY; 

(c) DVPIVGIITRLTAQKG; 

(d) NGQWLLGSA; 

(e) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(f) TGGLVDTV; 

(g) GIVNGIDNMEWNPEVD; and 

(h) AGADALLMPSRF(EA/)PCGLNQLYAMAYGT. 

19. The isolated or recombinant polypeptide according to claim 16 consisting of a wheat 
starch synthase II polypeptide which comprises an amino acid sequence selected from 
the group consisting of: 

(i) SEQ ID NO: 2; 

(ii) SEQ ID NO: 4; 

(iii) SEQ ID NO: 6; and 

(iv) a homologue of any one of (i) to (iii) having at least about 85% identity thereto. 



20. 



The isolated or recombinant polypeptide according to claim 17 consisting of a wheat 
starch synthase III polypeptide which comprises an amino acid sequence selected from 
the group consisting of: 
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(i) SEQ ID NO: 8; 

(ii) SEQ ID NO: 10; and 

(iii) a homologue of (i) or (ii) having at least about 85% identity thereto. 

21. The isolated or recombinant polypeptide according to claim 12 substantially free of 
conspecific or non-specific proteins. 

22. A method comprising: 

(i) hybridising single-stranded or double-stranded mRNA, cDNA or genomic DNA 
with a nucleotide sequence selected from the group consisting of: 

(a) the nucleotide sequence according to any one of claims 1 to 9; 

(b) a probe or primer derived from a nucleotide sequence according to sub- 
paragraph (a) and comprising at least about 1 5 contiguous nucleotides of said 
nucleotide sequence in length; and 

(ii) detecting the hybridised mRNA, cDNA or genomic DNA using a detecting 
means. 

23. The method according to claim 22 wherein the detecting means consists of a reporter 
molecule covalently attached to the probe or primer molecule. 

24. The method according to claim 22 wherein the detecting means consists of a 
polymerase chain reaction. 

25. The method according to claim 22 wherein the probe or primer comprises a nucleotide 
sequence selected from the group consisting of: 

(i) the nucleotide sequence set forth in SEQ ID NO: 25; 
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}Ua rn ir*lor*tirlo eoni icnro cot fnrth 

ine nucicuuuc bcLjucimti oci luiui 


in SFO ID NO* 26* 


(III) 


»Uq ni ir»lof^tiHo coni lonro cot fnrth 


in SEQ ID NO* 27' 


(IV) 


}ko ni A^ttHo co/ii lonro cot fir\rtH 

ine nucieoiiut; otsqutJiioo s>ci lurui 


in SEO ID NO* 28' 


(v) 


lUn ni irlaAtiHa cam icnra cot fr\rth 

ine nucieoiiQe sequence sei luiui 


in ^FO ID NO* 29* 

III OLW IL/ INV-/. 4?| 


(VI) 


trie nucieouoe sequence sei ionn 


in qfh in wo- in* 


(vii) 


the nucleotide sequence set forth 


in SEQ ID NO: 31; 


(viii) 


the nucleotide sequence set forth 


in SEQ ID NO: 32; 


(ix) 


the nucleotide sequence set forth 


in SEQ ID NO: 33; 


(x) 


the nucleotide sequence set forth 


in SEQ ID NO: 34; 


(xi) 


a nucleotide sequence which encodes an amino acid sequence selected from 



the group consisting of: 

(a) KVGGLGDWTS; 

(b) GHTVEVILPKY; 

(c) HDWSSAPVAWLYKEHY; 

(d) GILNGIDPDIWDPYTD; 

(e) D VP I VG I ITRLTAQKG; 

(f) NGQWLLGSA; 

(g) AGSDFIIVPSIFEPCGLTQLVAMRYGS; 

(h) TGGLVDTV; 

(i) KTGGLGDVAGA; 
(j) GHRVMVWPRY; 

(k) NDWHTALLPVYLKAYY; 
(I) GIVNGIDNMEWNPEVD; 
(m) DVPLLGFIGRLDGQKG; 
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(n) DVQLVMLGTG; 

(o)AGADALLMPSRF(E/V)PCGLNQLYAMAYGT; and 
(p)VGG(V/L)RDTV; 

(xii) a nucleotide sequence comprising at least about 15 contiguous nucleotides of 
an intron region of SEQ ID NO: 37; 

(xiii) a nucleotide sequence comprising at least about 1 5 contiguous nucleotides of 
an intron region of SEQ ID NO: 38; and 

(xiv) a nucleotide sequence which is complementary to any one of (i) to (xiii). 

26. A method of assaying for the presence or absence of a wheat starch synthase 
polypeptide in a plant or a plant extract or isolated nucleic acid sample, said method at 
least comprising performing the method according to any one of claims 22 to 25. 

27. The method according to claim 26 further comprising preparing the plant extract or 
nucleic acid sample. 

28. A method of marker-assisted breeding and/or selection of a plant at least comprising 
performing the method according to any one of claims 22 to 25. 

29. The method according to claim 28 further comprising selecting a plant which expresses 
a desirable wheat starch synthase characteristic. 

30. The method according to claim 28 further comprising crossing a plant which expresses 
a desirable wheat starch synthase characteristic to another plant. 

31. The method according to claim 30 further comprising selecting progeny of the cross 
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which expresses a desirable wheat starch synthase characteristic. 

32. A plant produced by the method according to any one of claims 28 to 31 wherein said 
plant expresses a wheat starch synthase polypeptide at a desired level detectable using 
said method. 

33. A method of modifying the starch content and/or starch composition of one or more 
tissues or organs of a plant, comprising expressing in said plant a nucleic acid molecule 
for a time and under conditions sufficient for the enzyme activity of one or more starch 
synthase isoenzymes to be modified, wherein said nucleic acid molecule is selected 
from the group consisting of: 

(i) the isolated nucleic acid molecule according to any one of claims 1 to 9; 

(ii) a fragment of (i) which comprises a nucleotide sequence capable of being 
expressed to down-regulate the expression of an endogenous wheat starch synthase 
isoenzyme of said plant; and 

(iii) a fragment of (i) which encodes a functional wheat starch synthase isoenzyme 
of said plant. 

34. The method according to claim 33 wherein the fragment at sub-paragraph (ii) is an 
antisense molecule, ribozyme molecule, co-suppression molecule, or gene-targeting 
molecule. 

35. The -method according to claim 33 further comprising introducing the nucleic acid 
molecule to an isolated plant cell, tissue, organ, or organelle. 

36. The method according to claim 35 further comprising regenerating an intact plant from 
the isolated plant cell, tissue, organ, or organelle carrying the introduced nucleic acid 
molecule. 
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37. The method according to claim 35 wherein the nucleic acid molecule is introduced to 
the plant cell, tissue, organ, or organelle by introgression. 

38. The method according to claim 35 wherein the nucleic acid molecule is introduced to 
the plant cell, tissue, organ, or organelle by transformation means. 

39. An isolated promoter sequence comprising a nucleotide sequence selected from the 
group consisting of: 

(i) nucleotides 1 to about 287 of SEQ ID NO: 1 1 ; 

(ii) nucleotides 1 to about 1416 of SEQ ID NO: 37; 

(iii) nucleotides 1 to about 973 of SEQ ID NO: 38; 

(iv) a fragment of any one of (i) to (iii) capable of conferring expression on a 
heterologous gene in a monocotyledonous plant cell, tissue or organ; and 

(v) a complementary nucleotide sequence to any one of (i) to (iv). 

40. The isolated promoter sequence according to claim 39 that is operable in the 
endosperm. 

41 . A plant carrying the isolated nucleic acid molecule according to any one of claims 1 to 
9 as an exogenous complement to its genome. 

42. A progeny of the plant according to claim 41 wherein said progeny carries the 
introduced nucleic acid molecule. 

43. A propagule of the plant according to claim 41 or 42 wherein said propagule carries the 
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introduced nucleic acid molecule present in said plant. 



44. A gene construct or vector which comprises the isolated nucleic acid molecule 
according to any one of claims 1 to 9 and one or more origins of replication. 



45. The gene construct according to claim 44 further comprising a promoter 
sequence in operable connection with said isolated nucleic acid molecule. 



46. A gene construct or vector which comprises the probe or primer according to 
claim 10 or 1 1 and one or more origins of replication. 



47. A modified starch derived from the plant according to claim 32 or 41 wherein said starch 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said plant 



48. A modified starch derived from the progeny according to claim 42 wherein said starch 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said progeny. 



49. A modified starch derived from the propagule according to claim 43 wherein said starch 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said propagule. 



50. A food product comprising the modified starch according to any one of claims 47 to 49. 



51. 



The food product according to claim 50 consisting of flour or a flour-based food product 
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52. The food product according to claim 50 or 51 selected from the group consisting of: 
flour-based sauce; leavened bread; unleavened bread; pasta, noodle; cereal; snack 
food; cake; and pastry. 



53. Use of the modified starch according to any one of claims 47 to 49 in the preparation 
of a food product for consumption by an animal or human. 



54. A modified protein derived from the plant according to claim 32 or 41 wherein said 
protein is modified by virtue of the use of the isolated nucleic acid according to claim 1 
to produce said plant. 



55. A modified protein derived from the progeny according to claim 42 wherein said protein 
is modified by virtue of the use of the isolated nucleic acid according to claim 1 to 
produce said progeny. 



56. A modified protein derived from the propagule according to claim 43 wherein said 
protein is modified by virtue of the use of the isolated nucleic acid according to claim 1 
to produce said propagule. 



57. A non-food product comprising the modified protein according to any one of claims 54 
to 56. 



58. The non-food product according to claim 57 selected from the group consisting of: films; 
coatings; adhesives; building materials; and packaging materials. 



59. Use of the modified protein according to any one of claims 54 to 56 in the preparation 
of a non-food product. 
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SEQUENCE LISTING 

<110> COMMONWEALTH SCIENTIFIC AND INDUSTRIAL RESEARCH ORGANISATION 
GOODMAN FIELDER LIMITED 
GROUPE LIMAGRAIN PACIFIC PTY LTD 

<120> NOVEL GENES ENCODING WHEAT STARCH SYNTHASES AND USES 
THEREFOR 

<130> p:\oper\mro\pi-wss.pct 

<140> TO BE ADVISED 
<141> 2000-04-28 

<150> AU PQ0052/99 
<151> 1999-04-29 

<160> 54 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 2939 
<212> DNA 

<213> Triticum aestivum 

<220> 
<221> CDS 

<222> (176) (2569) 
<400> 1. 

atttcctcgg cctgaccccg tgcgtttacc ccacacagag cacactccag tccagtccag 60 

cccactgccg cgctactccc cactcccact gccaccacct ccgcctgcgc cgcgctctgg 120 

gcggaccaac ccgcgcatcg tatcacgatc acccaccccg atcccggccg ccgcc atg 178 

Met 
1 

teg teg gcg gtc gcg tec gee gcg tec ttc etc gcg etc gcg tec gee 226 
Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser Ala 
5 10 15 

tec ccc ggg aga tea egg agg agg acg agg gtg age gcg teg cca ccc 274 
Ser Pro Gly Arg Ser Arg Arg Arg Thr Arg Val Ser Ala Ser Pro Pro 
20 25 30 

cac ace ggg get ggc agg ttg cac tgg ccg ccg teg ccg ccg cag cgc 322 
His Thr Gly Ala Gly Arg Leu His Trp Pro Pro Ser Pro Pro Gin Arg 
35 40 45 

acg get cgc gac gga gcg gtg gee gcg cgc gee gec ggg aag aag gac 370 
Thr Ala Arg Asp Gly Ala Val Ala Ala Arg Ala Ala Gly Lys Lys Asp 
50 55 60 65 

gcg ggg ate gac gac gec gcg ccc gcg agg cag ccc cgc gca etc cgc 418 
Ala Gly He Asp Asp Ala Ala Pro Ala Arg Gin Pro Arg Ala Leu Arg 
70 75 80 

ggt ggc gee gee acc aag gtt gcg gag egg agg gat ccc gtc aag acg 4 66 
Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val Lys Thr 
85 90 95 

etc gat cgc gac gee gcg gaa ggt ggc gcg ccg tec ccg ccg gca ccg 514 
Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ser Pro Pro Ala Pro 
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100 105 110 

agg cag gag gac gcc cgt ctg ccg age atg aac ggc atg ccg gtg aac 562 
Arg Gin Glu Asp Ala Arg Leu Pro Ser Met Asn Gly Met Pro Val Asn 
.115 120 125 

ggt gaa aac aaa tct acc ggc ggc ggc ggc gcg act aaa gac age ggg 610 
Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp Ser Gly 
130 135 140 145 

ctg ccc gca ccc gca cgc gcg ccc cag ccg teg age cag aac aga gta 658 
Leu Pro Ala Pro Ala Arg Ala Pro Gin Pro Ser Ser Gin Asn Arg Val 
150 155 160 

ccg gtg aat ggt gaa aac aaa get aac gtc gcc teg ccg ccg acg age 706 
Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro Thr Ser 
165 170 175 

ata gcc gag gtc gcg get ccg gat ccc gca get acc att tec ate agt 754 
He Ala Glu Val Ala Ala Pro Asp Pro Ala Ala Thr He Ser He Ser 
180 185 190 

gac aag gcg cca gag tec gtt gtc cca gcc gag aag gcg ccg ccg teg 802 
Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Ala Pro Pro Ser 
195 200 205 

tec ggc tea aat ttc gtg ccc teg get tct get ccc ggg tct gac act 850 
Ser Gly Ser Asn Phe Val Pro Ser Ala Ser Ala Pro Gly Ser Asp Thr 
210 215 220 225 

gtc age gac gtg gaa ctt gaa ctg aag aag ggt gcg gtc att gtc aaa 898 
Val Ser Asp Val Glu Leu Glu Leu Lys Lys Gly Ala Val He Val Lys 
230 235 240 

gaa get cca aac cca aag get ctt teg ccg ccc gca gca ccc get gta 94 6 
Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro Ala Val 
245 250 255 

caa caa gac ctt tgg gac ttc aag aaa tac att ggt ttc gag gag ccc 994 
Gin Gin Asp Leu Trp Asp Phe Lys Lys Tyr He Gly Phe Glu Glu Pro 
260 265 270 

gtg gag gcc aag gat gat ggc egg get gtt gca gat gat gcg ggc tec 1042 
Val Glu Ala Lys Asp Asp Gly Arg Ala Val Ala Asp Asp Ala Gly Ser 
275 280 285 

ttc gaa cac cac cag aat cac gat tec ggg cct ttg gca ggg gag aac 1090 
Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly Glu Asn 
290 295 300 305 

gtc atg aac gtg gtc gtc gtg get get gaa tgt tct ccc tgg tgc aaa 1138 
Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp Cys Lys 
310 315 320 

aca ggt ggt ctt gga gat gtt gcc ggt get ttg ccc aag get ttg gcg 1186 
Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala Leu Ala 
325 330 335 

aag aga gga cat cgt gtt atg gtt gtg gta cca agg tat ggg gac tat 1234 
Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly Asp Tyr 
340 345 350 

gag gaa gcc tac gat gtc gga gtc cga aaa tac tac aag get get gga 1282 
Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala Ala Gly 
355 360 365 
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cag tac act egg tec att atg gtg ata cat aac ate get cac cag ggc 
Gin Tyr Thr Arg Ser lie Met Val He His Asn He Ala His Gin Gly 
470 475 480 



cac gac ate ata egg cag aac gac tgg aag ace cgc ggc ate gtg aac 
His Asp He He Arg Gin Asn Asp Trp Lys Thr Arg Gly He Val Asn 
550 555 560 



ggc gac gtg ccg ctg etc ggc ttc ate ggg cgc ctg gac ggg cag aag 
Gly Asp Val Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly Gin Lys 
610 615 620 625 



1378 



cag gat atg gaa gtg aat tat ttc cat get tat ate gat gga gtt gat 1330 
Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr He Asp Gly Val Asp 
370 375 380 385 

ttt gtg ttc att gac get cct etc ttc cga cac cgc cag gaa gac att 
Phe Val Phe He Asp Ala Pro Leu Phe Arg His Arg Gin Glu Asp lie 
390 395 400 

tat ggg ggc age aga cag gaa att atg aag cgc atg att ttg ttc tgc 1426 
Tyr Gly Gly Ser Arg Gin Glu He Met Lys Arg Met He Leu Phe Cys 
405 410 415 

aag gee get gtc gag gtt cca tgg cac gtt cca tgc ggc ggt gtc cct 1474 
Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly Val Pro 
420 425 430 

tat ggg gat gga aat ctg gtg ttt att gca aat gat tgg cac acg gca 1522 
Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His Thr Ala 
435 440 445 

etc ctg cct gtc tat ctg aaa gca tat tac agg gac cat ggt ttg atg 1570 
Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly Leu Met 
450 455 460 465 



1618 



cgt ggc cca gta gat gag ttc ccg ttc ace gag ttg cct gag cac tac 1666 
Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Pro Glu His Tyr 
485 490 495 

ctg gaa cac ttc aga ctg tac gac ccc gtg ggt ggt gaa cac gee aac 1714 
Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His Ala Asn 
500 505 510 

tac ttc gee gee ggc ctg aag atg gcg gac cag gtt gtc gtc gtg age 1762 
Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val Val Ser 
515 520 525 

ccg ggg tac ctg tgg gag ctg aag acg gtg gag ggc ggc tgg ggg ctt 1810 
Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp Gly Leu 
530 535 540 545 



1858 



ggc ate gac aac atg gag tgg aac ccc gag gtg gac gtc cac etc aag 1906 
Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His Leu Lys 
565 570 575 

teg gac ggc tac ace aac ttc tec ctg ggg acg ctg gac tec ggc aag 1954 
Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser Gly Lys 
580 585 590 

egg cag tgc aag gag gee ctg cag egg gag ctg ggc ctg cag gtc cgc 2002 
Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin Val Arg 
595 600 605 



2050 
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ggc gtg gag ate ate gcg gac gcg atg ccc tgg ate gtg age cag gac 
Gly Val Glu lie lie Ala Asp Ala Met Pro Trp He Val Ser Gin Asp 
630 635 640 



2098 



gtg cag ctg gtc atg ctg ggc acc ggg cgc cac gac ctg gag ggc atg 2146 
Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu Gly Met 
645 650 655 

ctg egg cac ttc gag egg gag cac cac gac aag gtg cgc ggg tgg gtg 2194 
Leu Arg His Phe Glu Arg Glu His His Asp Lys val Arg Gly Trp Val 
660 665 670 

ggg ttc tec gtg egg ctg gcg cac egg ate acg gee ggc gee gac gcg 2242 
Gly Phe Ser Val Arg Leu Ala His Arg He Thr Ala Gly Ala Asp Ala 
675 680 685 

etc etc atg ccc tec egg ttc gag ccg tgc gga ctg aac cag etc tac 2290 
Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin Leu Tyr 
690 » 695 700 705 

gee atg gee tac ggc acc gtc ccc gtc gtg cat gee gtc ggt ggc ctg 2338 
Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly Gly Leu 
710 715 720 

agg gac acc gtg ccg ccg ttc gac ccc ttc aac cac tec ggg etc ggg 2386 
Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His. Ser Gly Leu Gly 
725 730 735 

tgg acg ttc gac cgc gca gag gcg cag aag ctg ate gag gcg etc ggg 24 34 
Trp Thr Phe Asp Arg Ala Glu Ala Gin Lys Leu He Glu Ala Leu Gly 
740 " 745 750 

cac tgc etc cgc acc tac egg gac tac aag gag age tgg agg ggg etc 2482 
His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg Gly Leu 
755 760 765 

cag gag cgc ggc atg teg cag gac ttc age tgg gag cat gee gee aag 2530 
Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala Ala Lys 
770 775 780 785 

etc tac gag gac gtc etc gtc aag gee aag tac cag tgg tgaaegctag 2579 
Leu Tyr Glu Asp Val Leu Val Lys Ala Lys Tyr Gin Trp 





790 




795 








ctgctagccg 


gtccagcccc 


gcatgcgtgc 


atgacaggat 


ggaattgege 


attgegcacg 


2639 


caggaaggtg 


ccatggagcg 


ccggcatccg 


cgaagtacag 


tgacatgagg 


tgtgtgtggt 


2699 


tgagacgctg 


attccgatct 


ggtcegtage 


agagtagagc 


ggaggtaggg 


aagcgctcct 


2759 


tgttacaggt 


atatgggaat 


gttgttaact 


tggtattgta 


atttgttatg 


ttgtgtgcat 


2819 


tattacagag 


ggcaacgatc 


tgcgccggcg 


caccggccca 


actgttgggc 


cggtcgcaca 


2879 


gcagccgttg 


gatccgaccg 


cctgggccgt 


tggatcccac 


cgaaaaaaaa 


aaaaaaaaaa 


2939 



<210> 2 
<211> 798 
<212> PRT 

<213> Triticum aestivum 
<400> 2 

Met Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser 
1 5-10 15 
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Ala Ser Pro Gly Arg Ser Arg Arg Arg Thr Arg Val Ser Ala Ser Pro 
20 25 30 

Pro His Thr Gly Ala Gly Arg Leu His Trp Pro Pro Ser Pro Pro Gin 
35 40 45 

Arg Thr Ala Arg Asp Gly Ala Val Ala Ala Arg Ala Ala Gly Lys Lys 
50 55 60 

Asp Ala Gly He Asp Asp Ala Ala Pro Ala Arg Gin Pro Arg Ala Leu 
65 70 75 • 80 

Arg Gly Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val Lys 
"85 90 95 

Thr Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ser Pro Pro Ala 
100 .105 HO 

Pro Arg Gin Glu Asp Ala Arg Leu Pro Ser Met Asn Gly Met Pro Val 
115 120 125 

Asn Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp Ser 
130 135 140 

Gly Leu Pro Ala Pro Ala Arg Ala Pro Gin Pro Ser Ser Gin Asn Arg 
145 150 155 160 

Val Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro Thr 
165 170 175 

Ser He Ala Glu Val Ala Ala Pro Asp Pro Ala Ala Thr He Ser lie 
180 185 190 

Ser Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Ala Pro Pro 
195 200 205 

Ser Ser Gly Ser Asn Phe Val Pro Ser Ala Ser Ala Pro Gly Ser Asp 
210 215 220 

Thr Val Ser Asp Val Glu Leu Glu Leu Lys Lys Gly Ala Val He Val 
225 230 235 240 

Lys Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro Ala 
245 250 255 

Val Gin Gin Asp Leu Trp Asp Phe Lys Lys Tyr He Gly Phe Glu Glu 
260 265 270 

Pro Val Glu Ala Lvs Asp Asp Gly Arg Ala Val Ala Asp Asp Ala Gly 
275 280 285 

Ser Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly Glu 
290 295 . 300 

Asn Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp Cys 
305 .310 315 320 

Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala Leu 
325 330 335 

Ala Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly Asp 
340 345 350 



Tyr Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala Ala 
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355 



360 



365 



Gly Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr He Asp Gly Val 
370 375 380 

Asp Phe Val Phe He Asp Ala Pro Leu Phe Arg His Arg Gin Glu Asp 
385 390 395 400 

He Tyr Gly Gly Ser Arg Gin Glu He Met Lys Arg Met He Leu Phe 
405 410 415 

Cys Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly Val 
420. 425 430 

Pro Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His Thr 
435 440 445 

Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly Leu 
450 455 460 

Met Gin Tyr Thr Arg Ser He Met Val He His Asn He Ala His Gin 
465 " 470 475 480 

Gly Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu. Pro Glu His 
485 490 495 

Tyr Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His Ala 
500 505 510 

Asn Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp Gin Val Val Val Val 
515 520 525 

Ser Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp Gly 
530 535 540 

Leu His Asp He lie Arg Gin Asn Asp Trp Lys Thr Arg Gly He Val 
545 550 555 560 

Asn Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His Leu 
565 570 575 

Lys Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser Gly 
580 585 590 

Lys Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin Val 
595 600 605 

Arg Gly Asp Va 1 Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly Gin 
610 615 620 

Lys Gly Val Glu He He Ala Asp Ala Met Pro Trp He Val Ser Gin 
625 630 635 640 

Asp Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu Gly 
645 650 655 

Met Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly Trp 
660 665. 670 

Val Gly Phe Ser Val Arg Leu Ala His Arg He Thr Ala Gly Ala Asp 
675 680 685 



Ala Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin Leu 
690 695 700 
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Tyr Ala Met Ala Tyr Gly Thr Val Pro Val Val His Ala Val Gly Gly 

705 710 715 720 

Leu Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly Leu 

725 730 735 

Gly Trp Thr Phe Asp Arg Ala Glu Ala Gin Lys Leu He Glu Ala Leu 

740 745 750 

Gly His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg Gly 

755 760 765 

Leu Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala Ala 

770 ~ 775 780 

Lys Leu Tyr Glu Asp Val Leu Val Lys Ala Lys Tyr Gin Trp 

785 790 795 



<210> 3 
<211> 2842 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (89).. (2485) 

<400> 3 

gctgccacca cctccgcctg cgccgcgctc tgggcggagg accaacccgc gcatcgtacc 60 

atcgcccgcc ccgatcccgg ccgccgcc atg teg teg gcg gtc gcg tec gee 112 

Met Ser Ser Ala Val Ala Ser Ala 
1 5 



gcg tec ttc etc gcg etc gee tec gee tec ccc ggg aga tea cgc agg 

Ala Ser Phe Leu Ala Leu Ala Ser Ala Ser Pro Gly Arg Ser Arg Arg 

10 15 20 

egg gcg agg gtg age gcg ccg cca ccc cac gee ggg gee ggc agg ctg 

Arg Ala Arg Val Ser Ala Pro Pro Pro His Ala Gly Ala Gly Arg Leu 

25 30 35 40 



160 



208 



cac tgg ccg ccg tgg ccg ccg cag cgc acg get cgc gac gga ggt gtg 256 
His Trp Pro Pro Trp Pro Pro Gin Arg Thr Ala Arg Asp Gly Gly Val 
45 50 55 

gec gcg cgc gec gee ggg aag aag gac gcg agg gtc gac gac gac gec 304 
Ala Ala Arg Ala Ala Gly Lys Lys Asp Ala Arg Va\ Asp Asp Asp Ala 
60 65 70 

gcg tec gcg agg cag ccc cgc gca cgc cgc ggt ggc gee gee ace aag 
Ala Ser Ala Arg Gin Pro Arg Ala Arg Arg Gly Gly Ala Ala Thr Lys 
75 80 85 

gtc gcg gag egg agg gat ccc gtc aag acg etc gat cgc gac gee gcg 
Val Ala Glu Arg Arg Asp Pro Val Lys Thr Leu Asp Arg Asp Ala Ala 
90 95 100 

gaa ggt ggc gcg ccg gca ccg ccg gca ccg agg cag gac gee gee cgt 448 
Glu Gly Gly Ala Pro Ala Pro Pro Ala Pro Arg Gin Asp Ala Ala Arg 
105 110 115 120 

cca ccg agt atg aac ggc acg ccg gtg aac ggt gag aac aaa tct acc 496 
Pro Pro Ser Met Asn Gly Thr Pro Val Asn Gly Glu Asn Lys Ser Thr 



352 



400 
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125 130 135 

ggc ggc ggc ggc gcg acc aaa gac age ggg ctg ccc gca ccc gca cgc 544 
Gly Gly Gly Gly Ala Thr Lys Asp Ser Gly Leu Pro Ala Pro Ala Arg 
140 145 150 

gcg ccc cat ccg teg acc cag aac aga gta cca gtg aac ggt gaa aac 592 
Ala Pro His Pro Ser Thr Gin Asn Arg Val Pro Val Asn Gly Glu Asn 
155 160 165 

aaa get aac gtc gee teg ccg ccg acg age ata gee gag gtc gtg get 
Lys Ala Asn Val Ala Ser Pro Pro Thr Ser lie Ala Glu Val Val Ala 
170 175 180 

ccg gat tec gca get acc att tec ate agt gac aag gcg ccg gag tec 
Pro Asp Ser Ala Ala Thr He Ser He Ser Asp Lys Ala Pro Glu Ser 
185 190 195 200 

gtt gtc cca gee gag aag ccg ccg ccg teg tec ggc tea aat ttc gtg 736 
Val Val Pro Ala Glu Lys Pro Pro Pro Ser Ser Gly Ser Asn Phe Val 
205 * 210 215 

gtc teg get tct get ccc agg ctg gac att gac age gat gtt gaa cct 784 
Val Ser Ala Ser Ala Pro Arg Leu Asp He Asp Ser Asp Val Glu Pro 
220 225 230 

gaa ctg aag aag ggt gcg gtc ate gtc gaa gaa get cca aac cca aag 
Glu Leu Lys Lys Gly Ala Val He Val Glu Glu Ala Pro Asn Pro Lys 
235 240 245 

get ctt teg ccg cct gca gee ccc get gta caa gaa gac ctt tgg gac 
Ala Leu Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp 
250 255 260 

ttc aag aaa tac att ggc ttc gag gag ccc gtg gag gee aag gat gat 
Phe Lys Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp 
265 270 275 280 

ggc tgg get gtt gca gat gat gcg ggc tec ttt gaa cat cac cag aac 
Gly Trp Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn 
285 290 295 

cat gat tec gga cct ttg gca ggg gag aac gtc atg aac gtg gtc gtc 
His Asp Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val 
300 305 .310 

gtg get get gaa tgt tct ccc tgg tgc aaa aca ggt ggt ctt gga gat 
Val Ala Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp 
315 320 325 

gtt gec ggt get ttg ccc aag get ttg gcg aag aga gga cat cgt gtt 
Val Ala Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val 
330 335 340 

atg gtt gtg gta cca agg tat ggg gac tat gag gaa gee tac gat gtc 
Met Val Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Ala Tyr Asp Val 
345 350 355 360 

gga gtc cga aaa tac tac aag get get gga cag gat atg gaa gtg aat 1216 
Gly Val Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn 
365 " 370 375 



832 



880 



928 



976 



1024 



1072 



1120 



1168 



tat ttc cat get tat ate gat gga gtt gat ttt gtg ttc att gac get 
Tyr Phe His Ala Tyr He Asp Gly Val Asp Phe Val Phe lie Asp Ala 
380 385 390 



1264 
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cct etc ttc cga cac cgc cag gaa gac att tat ggg ggc age aga cag 1312 
Pro Leu Phe Arg His Arg Gin Glu Asp lie Tyr Gly Gly Ser Arg Gin 
395 400 405 

gaa att atg aag cgc atg att ttg ttc tgc aag gec get gtc gag gtt 1360 
Glu He Met Lys Arg Met lie Leu Phe Cys Lys Ala Ala Val Glu Val 
410 415 420 

cct tgg cac gtt cca tgc ggc ggt gtc cct tat ggg gat gga aat ctg 1408 
Pro Trp His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu 
425 430 435 440 . 

gtg ttt att gca aat gat tgg cac acg gca etc ctg cct gtc tat ctg 1456 
Val Phe He Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu 
445 450 455 

aaa gca tat tac agg gac cat ggt ttg atg cag tac act egg tec att 1504 
Lys Ala Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser He 
• 460 465 470 

atg gtg ata cat aac ate gcg cac cag ggc cgt ggc cca gta gat gaa 1552 
Met Val He His Asn He Ala His Gin Gly Arg Gly Pro val Asp Glu 
475 480 485 

ttc ccg ttc ace gag ttg cct gag cac tac ctg gaa cac ttc aga ctg 1600 
Phe Pro Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu 
490 495 500 

tac gac ccc gtg ggt ggt gag cac gee aac tac ttc gec gec ggc ctg 1648 
Tyr Asp Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu 
505 510 515 520 

aag atg gcg gac cag gtt gtc gtg gtg age ccc ggg tac ctg tgg gag 1696 
Lys Met Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu 
525 530 535 

etc aag acg gtg gag ggc ggc tgg ggg ctt cac gac ate ata egg cag 1744 
Leu Lys Thr Val Glu Gly Gly Trp Gly Leu His Asp He He Arg Gin 
540 545 550 

aac gac tgg aag acc cgc ggc ate gtc aac ggc ate gac aac atg gag 1792 
Asn Asp Trp Lys Thr Arg Gly He Val Asn Gly He Asp Asn Met Glu 
555 560 565 

tgg aac ccc gag gtg gac gtc cac etc aag teg gac ggc tac acc aac 1840 
Trp Asn Pro Glu Val Asp Val His Leu Lys Ser Asp Gly Tyr Thr Asn 
570 575 580 

ttc tec ctg ggg acg ctg gac tec ggc aag egg cag tgc aag gag gee 1888 
Phe Ser Leu Gly Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala 
585 590 595 600 

ctg cag cgc gag ctg ggc ctg cag gtc cgc gec gac gtg ccg ctg etc 1936 
Leu Gin Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu 
605 610 615 

ggc ttc ate ggc cgc ctg gac ggg cag aag ggc gtg gag ate ate gcg 1984 
Gly Phe He Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala 
620 625 630 

gac gec atg ccc tgg ate gtg age cag gac gtg cag ctg gtc atg ctg 2032 
Asp Ala Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu 
635 640 645 
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ggc acc ggc cgc cac gac ctg gag age atg ctg egg cac ttc gag egg 
Gly Thr Gly Arg His Asp Leu Glu Ser Met Leu Arg His Phe Glu Arg 
650 655 660 



2080 



gag cac cac gac aag gtg cgc ggg tgg gtg ggg ttc tec gtg cgc ctg 2128 
Glu His His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu 
665 670 675 680 

gcg cac egg ate acg gcg ggc gec gac gcg etc etc atg ccc tec egg 2176 
Ala His Arg He Thr. Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg 
685 690 695 



ttc gag ccg tgc ggg ttg aac cag ctt tac gee atg gee tac ggc acc 
Phe Glu Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr 
700 705 710 



2224 



gtc ccc gtc gtg cac gec gtc ggc ggg gtg agg gac acc gtg ccg ccg 2272 
val Pro Val Val His Ala Val Gly Gly Val Arg Asp Thr Val Pro Pro 
715 720 725 



ttc gac ccc ttc aac cac tec ggc etc ggg tgg acg ttc gac cgc gee 
Phe Asp Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala 
730 735 740 

gag gcg cac aag ctg ate gag gcg etc ggg cac tgc etc cgc acc tac 
Glu Ala His Lys Leu He Glu Ala Leu Gly His Cys Leu Arg Thr Tyr 
745 750 755 760 



2320 



2368 



egg gac tac aag gag age tgg agg ggc etc cag gag cgc ggc atg teg 2416 
Arq Asp Tyr Lys Glu Ser Trp Arg Gly Leu Gin Glu Arg Gly Met Ser 
7 65 770 775. 



cag gac ttc age tgg gag cat gee gee aag etc tac gag gac gtc etc 
Gin Asp Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu 
780 785 790 



2464 



etc aag gee aag tac cag tgg tgaaegctag ctgctagccg ctccagcccc 2515 
Leu Lys Ala Lys Tyr Gin Trp 
795 

gcatgcgtgc atgeatgaga gggtggaact gcgcattgcg cccgcaggaa cgtgccatcc 2575 
ttctcgatgg gagcgccggc atecgegagg tgcagtgaca tgagaggtgt gtgtggttga 2635 
gaegctgatt ccgatctcga tctggtccgt agcagagtag ageggaegta gggaageget 2695 
ccttgttgca ggtatatggg aatgttgtca acttggtatt gtagtttgct atgttgtatg 2755 
cgttattaca atgttgttac ttattcttgt taagteggag geaaagggeg aaagctagct 2815 
cacatgaaaa aaaaaaaaaa aaaaaaa 28 42 



<210> 4 
<211> 799 
<212> PRT 

<213> Triticum aestivum 
<400> 4 

Met Ser Ser Ala Val Ala Ser Ala Ala Ser Phe Leu Ala Leu Ala Ser 
1 5 10 15 

Ala Ser Pro Gly Arg Ser Arg Arg Arg Ala Arg Val Ser Ala Pro Pro 
20 25 30 
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Pro His Ala Gly Ala Gly Arg Leu His Trp Pro Pro Trp Pro Pro Gin 
35 40 45 

Arg Thr Ala Arg Asp Gly Gly Val Ala Ala Arg Ala Ala Gly Lys Lys 
50 55 60 

Asp Ala Arg Val Asp Asp Asp Ala Ala Ser Ala Arg Gin Pro Arg Ala 
65 70 75 80 

Arg Arg Gly 'Gly Ala Ala Thr Lys Val Ala Glu Arg Arg Asp Pro Val 
85 90 95 

Lys Thr Leu Asp Arg Asp Ala Ala Glu Gly Gly Ala Pro Ala Pro Pro 
100 105 110 

Ala Pro Arg Gin Asp Ala Ala Arg Pro Pro Ser Met Asn Gly Thr Pro 
. 115 120 125 

Val Asn Gly Glu Asn Lys Ser Thr Gly Gly Gly Gly Ala Thr Lys Asp 
130 135 140 

Ser Gly Leu Pro Ala Pro Ala Arg Ala Pro His Pro Ser Thr Gin Asn 
145 150 155 160 

Arg Val Pro Val Asn Gly Glu Asn Lys Ala Asn Val Ala Ser Pro Pro 
165 170 175 

Thr Ser He Ala Glu Val Val Ala Pro Asp Ser Ala Ala Thr He Ser 
180 185 190 

lie Ser Asp Lys Ala Pro Glu Ser Val Val Pro Ala Glu Lys Pro Pro 
195 200 205 

Pro Ser Ser Gly Ser Asn Phe Val Val Ser Ala Ser Ala Pro Arg Leu 
210 215 220 

Asp He Asp Ser Asp Val Glu Pro Glu Leu Lys Lys Gly Ala Val He 
225 ' 230 235 240 

Val Glu Glu Ala Pro Asn Pro Lys Ala Leu Ser Pro Pro Ala Ala Pro 
245 250 255 

Ala Val Gin Glu Asp Leu Trp Asp Phe Lys Lys Tyr He Gly Phe Glu 
260 265 270 

Glu Pro Val Glu Ala Lys Asp Asp Gly Trp Ala Val Ala Asp Asp Ala 
275 280 285 

Gly Ser Phe Glu His His Gin Asn His Asp Ser Gly Pro Leu Ala Gly 
290 295 300 

Glu Asn Val Met Asn Val Val Val Val Ala Ala Glu Cys Ser Pro Trp 
305 310 315 320 

Cys Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala Leu Pro Lys Ala 
325 330 335 

Leu Ala Lys Arg Gly His Arg Val Met Val Val Val Pro Arg Tyr Gly 
340 345 ■ 350 

Asp Tyr Glu Glu Ala Tyr Asp Val Gly Val Arg Lys Tyr Tyr Lys Ala 
355 360 365 



Ala Gly Gin Asp Met Glu Val Asn Tyr Phe His Ala Tyr He Asp Gly 
370 375 380 
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Val Asp Phe Val Phe lie Asp Ala Pro Leu Phe Arg His Arg Gin Glu 
385 390 395 400 

Asp lie Tyr Gly Gly Ser Arg Gin Glu He Met Lys Arg Met He Leu 
405 410 415 

Phe Cys Lys Ala Ala Val Glu Val Pro Trp His Val Pro Cys Gly Gly 
420 425 430 

Val Pro Tyr Gly Asp Gly Asn Leu Val Phe He Ala Asn Asp Trp His 
435 440 445 

Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr Arg Asp His Gly 
450 455 460 

Leu Met Gin Tyr Thr Arg Ser He Met Val He His Asn He Ala His 
465 470 475 480 

Gin Gly Arg Gly Pro Val Asp Glu Phe Pro Phe Thr Glu Leu Pro Glu 
• 485 490 495 

His Tyr Leu Glu His Phe Arg Leu Tyr Asp Pro Val Gly Gly Glu His 
500 505 .510 

Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met Ala Asp. Gin Val Val Val 
515 520 525 

Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys Thr Val Glu Gly Gly Trp 
530 " 535 540 

Gly Leu His Asp He He Arg Gin Asn Asp Trp Lys Thr Arg Gly lie 
545 550 555 560 

Val Asn Gly He Asp Asn Met Glu Trp Asn Pro Glu Val Asp Val His. 

565 570 575 

Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser Leu Gly Thr Leu Asp Ser 
580 * 585 590 

Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin Arg Glu Leu Gly Leu Gin 
595 600 605 

Val Arg Ala Asp Val Pro Leu Leu Gly Phe He Gly Arg Leu Asp Gly 
610 615 620 

Gin Lys Gly Val Glu He He Ala Asp Ala Met Pro Trp. He Val Ser 
625 630 635 640 

Gin Asp Val Gin Leu Val Met Leu Gly Thr Gly Arg His Asp Leu Glu 
64 5 650 655 

Ser Met Leu Arg His Phe Glu Arg Glu His His Asp Lys Val Arg Gly 
660 665 670 

Trp Val Gly Phe Ser Val Arg Leu Ala His Arg He Thr Ala Gly Ala 
675 680 685 

Asp Ala Leu Leu Met Pro Ser Arg Phe Glu Pro Cys Gly Leu Asn Gin 
690 695 700 

Leu Tyr Ala Met Ala Tyr Gly Thr Val. Pro Val Val His Ala Val Gly 
705 710 715 720 

Gly Val Arg Asp Thr Val Pro Pro Phe Asp Pro Phe Asn His Ser Gly 



WO 00/66745 



PCT/AU00/00385 



13 



725 730 735 

Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala His Lys Leu lie Glu Ala 
740 745 750 

Leu Gly His Cys Leu Arg Thr Tyr Arg Asp Tyr Lys Glu Ser Trp Arg 
755 760 765 

Gly Leu Gin Glu Arg Gly Met Ser Gin Asp Phe Ser Trp Glu His Ala 
770 775 780 

Ala Lys Leu Tyr Glu Asp Val Leu Leu Lys Ala Lys Tyr Gin Trp 
785 790 795 



<210> 5 
<211> 2107 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (1) . . (1791) 

<400> 5 

cca get gag aag acg ccg ccg teg tec ggc tea aat ttc gag tec teg 

Pro Ala Glu Lys Thr Pro Pro Ser Ser Gly Ser Asn Phe Glu Ser Ser 
1 - 5 10 15 

gee tct get ccc ggg tct gac act gtc age gac gtg gaa caa gaa ctg 
Ala Ser Ala Pro Gly Ser Asp Thr Val Ser Asp Val Glu Gin Glu Leu 
20 25 30 



teg ccg cct gca gee ccc get gta caa gaa gac ctt tgg gat ttc aag 
Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp Phe Lys 
50 55 60 



48 



96 



aag aag ggt gcg gtc gtt gtc gaa gaa get cca aag cca aag get ctt 144 
Lys Lys Gly Ala Val Val Val Glu Glu Ala Pro Lys Pro Lys Ala Leu 
35 40 . 45 



192 



aaa tac att ggt ttc gag gag ccc gtg gag gee aag gat gat ggc egg 240 
Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp Gly Arg 
65 70 75 80 

get gtc gca gat gat gcg ggc tec ttt gaa cac cac cag aat cac gac 288 
Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn His Asp 
85 90 95 

tec gga cct ttg gca ggg gag aat gtc atg aac gtg gtc gtc gtg get 336 
Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val Val Ala 
100 105 HO 

get gag tgt tct ccc tgg tgc aaa aca ggt ggt ctg gga gat gtt gcg 384 
Ala Glu Cys Ser Pro Trp Cys Lys Thr Gly Gly Leu Gly Asp Val Ala 
115 120 125 

ggt get ctg ccc aag get ttg gca aag aga gga cat cgt gtt atg gtt 432 
Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val Met Val 
130 "* 135 140 

gtg gta cca agg tat ggg gac tat gaa gaa cct acg gat gtc gga gtc 480 
Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Pro Thr Asp Val Gly Val 
145 150 155 160 
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cga aaa tac tac aag get get gga cag gat atg gaa gtg aat tat ttc 
Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn Tyr Phe 
165 170 175 



528 



cat get tat ate gat gga gtt gat ttt gtg ttc att gac get cct etc 576 
His Ala Tyr lie Asp Gly Val Asp Phe Val Phe lie Asp Ala Pro Leu 
180 185 190 

ttc cga cac cga gag gaa gac att tat ggg ggc age aga cag gaa att 624 
Phe Arg His Arg Glu Glu Asp lie Tyr Gly Gly Ser Arg Gin Glu lie 
195 200 205 

atg aag cgc atg att ttg ttc tgc aag gec get gtt gag gtt cca tgg 672 
Met Lys Arg Met He Leu Phe Cys Lys Ala Ala Val Glu Val Pro Trp 
210 215 220 

cac gtt cca tgc ggc ggt gtc cct tat ggg gat gga aat ctg gtg ttt 720 
His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu Val Phe 
225 230 235 240 

att gca aat gat tgg cac acg gca etc ctg cct gtc tat ctg aaa gca 768 
He Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala 
245 250 255 

tat tac agg gac cat ggt ttg atg cag tac act egg tec att atg gtg 816 
Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser He Met Val 
260 265 270 

ata cat aac ate get cac cag ggc cgt ggc cct gta gat gaa ttc ccg .864 
He His Asn He Ala His Gin Gly Arg Gly Pro Val Asp Glu Phe Pro 
275 280 285 

ttc ace gag ttg cct gag cac tac ctg gaa cac ttc aga ctg tac gac 912 
Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu Tyr Asp 
290 295 300 

ccc gtg ggt ggt gaa cac gec aac tac ttc gec gee ggc ctg aag atg 960 
Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met 
305 310 315 320 

gcg gac cag gtt gtc gtg gtg age ccc ggg tac ctg tgg gag ctg aag 1008 
Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys 
325 330 335 

acg gtg gag ggc ggc tgg ggg ctt cac gac ate ata egg cag aac gac 1056 
Thr Val Glu Gly Gly Trp Gly Leu His Asp He He Arg Gin Asn Asp 
340 345 350 

tgg aag acc cgc ggc ate gtc aac ggc ate gac aac atg gag tgg aac 1104 
Trp Lys Thr Arg Gly He Val Asn Gly He Asp Asn Met Glu Trp Asn 
355 360 365 

ccc gag gtg gac. gec cac etc aag teg gac ggc tac acc aac ttc tec 1152 
Pro Glu Val Asp Ala His Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser 
370 375 380 

ctg agg acg ctg gac tec ggc aag egg cag tgc aag gag gee ctg cag 1200 
Leu Arg Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin 
385. 390 . 395 400 

cgc gag ctg ggc ctg cag gtc cgc gee gac gtg ccg ctg etc ggc ttc 1248 
Arg Glu Leu Gly Leu Gin Val Arg Ala Asp Val Pro Leu Leu Gly Phe 
405 410 415 

ate ggc cgc ctg gac ggg cag aag ggc gtg gag ate ate gcg gac gee 1296 
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Ile Gly Arg Leu Asp Gly Gin Lys Gly Val Glu lie He Ala Asp Ala 
420 425 430 

atg ccc tgg ate gtg age cag gac gtg cag ctg gtg atg ctg ggc acc 1344 
Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu Gly Thr 
435 440 445 

ggg cgc cac gac ctg gag age atg ctg cag cac ttc gag egg gag cac 1392 
Gly Arg His Asp Leu Glu Ser Met Leu Gin His Phe Glu Arg Glu His 
450 .455 460 

cac gac aag gtg cgc ggg tgg gtg ggg ttc tec gtg cgc ctg gcg cac 1440 
His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu Ala His 
465 470 475 480 

egg ate acg gcg ggg gcg gac gcg etc etc atg ccc tec egg ttc gtg 14 88 
Arg He Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Val 
485 4 90 4 95 

ccg tgc ggg ctg aac cag etc tac gee atg gee tac ggc acc gtc ccc 1536 
Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro 
500 505 510 

gtc gtg cac gee gtc ggc ggc etc agg gac acc gtg ccg ccg ttc gac 1584 
Val Val His Ala Val Gly Gly Leu Arg Asp Thr Val Pro Pro Phe Asp 
515 520 525 

ccc ttc aac cac tec ggg etc ggg tgg acg ttc gac cgc gec gag gcg 1632 
Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala 
530 535 540 

cac aag ctg ate gag gcg etc ggg cac tgc etc cgc acc tac cga gac 1680 
His Lys Leu He Glu Ala Leu Gly His Cys Leu Arg Thr Tyr Arg Asp 
545 550 555 560 

ttc aag gag age tgg agg gee etc cag gag cgc ggc atg teg cag gac 1728 
Phe Lys Glu Ser Trp Arg Ala Leu Gin Glu Arg Gly Met Ser Gin Asp 
565 570 575 

ttc age tgg gag cac gee gee aag etc tac gag gac gtc etc gtc aag 1776 
Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu Val Lys 
580 585 590 

gee aag tac cag tgg tgaaegctag ctgctagccg ctccagcccc gcatgcgtgc 1831 
Ala Lys Tyr Gin Trp 
595 

atgacaggat ggaactgeat tgcgcacgca ggaaagtgcc atggagcgcc ggcatccgcg 1891 
aagtacagtg acatgaggtg tgtgtggttg agaegctgat tccaatccgg cccgtagcag 1951 
agtagagegg aggtatatgg gaatcttaac ttggtattgt aatttgttat gttgtgtgca 2011 
ttattacaat gttgttactt attcttgtta agteggagge caagggegaa agctagctca 2071 
catgtctgat ggatgcaaaa aaaaaaaaaa aaaaaa 2107 



<210> 6 
<211> 597 
<212> PRT 

<213> Triticum aestivum 
<400> 6 

Pro Ala Glu Lys Thr Pro Pro Ser Ser Gly Ser Asn Phe Glu. Ser Ser 



WO 00/66745 



PCT/AU00/00385 



- 16- 



10 



15 



Ala Ser Ala Pro Gly Ser Asp Thr Val Ser Asp Val Glu Gin Glu Leu 
20 25 30 

Lys Lys Gly Ala Val Val Val Glu Glu Ala Pro Lys Pro Lys Ala Leu 
35 40 45 

Ser Pro Pro Ala Ala Pro Ala Val Gin Glu Asp Leu Trp Asp Phe Lys 
50 55 60 

Lys Tyr He Gly Phe Glu Glu Pro Val Glu Ala Lys Asp Asp Gly Arg 
65 70 75 80 

Ala Val Ala Asp Asp Ala Gly Ser Phe Glu His His Gin Asn His Asp 
85 90 95 

Ser Gly Pro Leu Ala Gly Glu Asn Val Met Asn Val Val Val Val Ala 
100 105 HO 

Ala Glu Cys Ser Pro Trp Cys Lys Thr. Gly Gly Leu Gly Asp Val Ala 
. 115 120 125 

Gly Ala Leu Pro Lys Ala Leu Ala Lys Arg Gly His Arg Val Met Val 
130 135 140 

Val Val Pro Arg Tyr Gly Asp Tyr Glu Glu Pro Thr Asp Val Gly Val 
145 150 155 160 

Arg Lys Tyr Tyr Lys Ala Ala Gly Gin Asp Met Glu Val Asn Tyr Phe 
165 170 175 

His Ala Tyr He Asp Gly Val Asp Phe Val Phe He Asp Ala Pro Leu 
180 185 190 

Phe Arg His Arg Glu Glu Asp He Tyr Gly Gly Ser Arg Gin Glu He 
195 200 205 

Met Lys Arg Met He Leu Phe Cys Lys Ala Ala Val Glu Val Pro Trp 
210 215 220 

His Val Pro Cys Gly Gly Val Pro Tyr Gly Asp Gly Asn Leu Val Phe 
225 230 235 240 

He Ala Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala 
245 250 255 

Tyr Tyr Arg Asp His Gly Leu Met Gin Tyr Thr Arg Ser He Met Val 
260 " 265 270 

He His Asn He Ala His Gin Gly Arg Gly Pro Val Asp Glu Phe Pro 
275 280 285 

Phe Thr Glu Leu Pro Glu His Tyr Leu Glu His Phe Arg Leu Tyr Asp 
290 295 300 

Pro Val Gly Gly Glu His Ala Asn Tyr Phe Ala Ala Gly Leu Lys Met 
305 310 315 320 

Ala Asp Gin Val Val Val Val Ser Pro Gly Tyr Leu Trp Glu Leu Lys 
325 330 335 



Thr Val Glu Gly Gly Trp Gly Leu His Asp He He Arg Gin Asn Asp 
340 345 350 
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Trp Lys Thr Arg Gly lie Val Asn Gly lie Asp Asn Met Glu Trp Asn 
355 360 365 

Pro Glu Val Asp Ala His Leu Lys Ser Asp Gly Tyr Thr Asn Phe Ser 
370 375 380 

Leu Arg Thr Leu Asp Ser Gly Lys Arg Gin Cys Lys Glu Ala Leu Gin 
385 390 395 400 

Arg Glu Leu Gly Leu. Gin Val Arg Ala Asp Val Pro Leu Leu Gly Phe 
405 410 415 

lie Gly Arg Leu Asp Gly Gin Lys Gly Val Glu He He Ala Asp Ala 
420 425 430 

Met Pro Trp He Val Ser Gin Asp Val Gin Leu Val Met Leu Gly Thr 
435 440 445 

Gly Arg His Asp Leu Glu Ser Met Leu Gin His Phe Glu Arg Glu His 
450 455 460 

His Asp Lys Val Arg Gly Trp Val Gly Phe Ser Val Arg Leu Ala His 
465 470 475 480 

Arg lie Thr Ala Gly Ala Asp Ala Leu Leu Met Pro Ser Arg Phe Val 
485 490 495 

Pro Cys Gly Leu Asn Gin Leu Tyr Ala Met Ala Tyr Gly Thr Val Pro 
500 505 510 

Val Val His Ala Val Gly Gly Leu Arg Asp Thr Val Pro Pro Phe Asp 
515 520 525 

Pro Phe Asn His Ser Gly Leu Gly Trp Thr Phe Asp Arg Ala Glu Ala 
530 535 540 

His Lys Leu He Glu Ala Leu Gly His Cys Leu Arg Thr Tyr Arg Asp 
545 ' 550 555 560 

Phe Lys Glu Ser Trp Arg Ala Leu Gin Glu Arg Gly Met Ser Gin Asp 
565 570 575 

Phe Ser Trp Glu His Ala Ala Lys Leu Tyr Glu Asp Val Leu Val Lys 
580 585 .590 

Ala Lys Tyr Gin Trp 
595 



<210> 7 
<211> 5346 
<212> DNA 

<213> Triticum aestivum 



<220> 

<221> CDS 

<222> (29) . . (4912) 

<400> 7 

cggcacgagg tttagtaggt tccgggaa atg gag atg tct etc tgg cca egg 52 

Met Glu Met Ser Leu Trp Pro Arg 

1 . 5 

age ccc ctg tgc cct egg age agg cag ccg etc gtc gtc gtc egg ccg 100 
Ser Pro Leu Cys Pro Arg Ser Arg Gin Pro Leu Val Val Val Arg Pro 
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10 15 20 

gcc ggc cgc ggc ggc etc acg cag cct ttt ttg atg aat ggc aga ttt 

Ala Gly Arg Gly Gly Leu Thr Gin Pro Phe Leu Met Asn Gly Arg Phe 

25 30 35 40 

act cga age agg acc ctt cga tgc atg gta gca agt tea gat cct cct. 196 

Thr Arg Ser Arg Thr Leu Arg Cys Met Val Ala Ser Ser Asp Pro Pro 

45 50 55 



aat agg aaa tea aga agg atg gta cca cct cag gtt aaa gtc att tct 
Asn Arg Lys Ser Arg Arg Met Val Pro Pro Gin Val Lys Val lie Ser 
60 65 70 

tct aga gga tat acg aca aga etc att gtt gaa cca age aac gag aat 
Ser Arg Gly Tyr Thr Thr Arg Leu lie Val Glu Pro Ser Asn Glu Asn 
75 80 85 

aca gaa cac aat aat egg gat gaa gaa act ctt gat aca tac aat gcg 
Thr Glu His Asn Asn Arg Asp Glu Glu Thr Leu Asp Thr Tyr Asn Ala 
90 95 100 

eta tta agt acc gag aca gca gaa tgg aca gat aat aga gaa gcc gag 
Leu Leu Ser Thr Glu Thr Ala Glu Trp Thr Asp Asn Arg Glu Ala Glu 
105 HO 115 120 

act get aaa gcg gac teg teg caa aat get tta age agt tct ata att 
Thr Ala Lys Ala Asp Ser Ser Gin Asn Ala Leu Ser Ser Ser He He 
125 130 135 

ggg gaa gtg gat gtg gcg gat gaa gat ata ctt gcg get gat ctg aca 
Gly Glu Val Asp Val Ala Asp Glu Asp He Leu Ala Ala Asp Leu Thr 
140 145 150 

gtg tat tea ttg age agt gta atg aag aag gaa gtg gat gca gcg gac 
Val Tyr Ser Leu Ser Ser Val Met Lys Lys Glu Val Asp Ala Ala Asp 
155 160 165 

aaa get aga gtt aaa gaa gac gca ttt gag ctg gat ttg cca gca act 
Lys Ala Arg Val Lys Glu Asp Ala Phe Glu Leu Asp Leu Pro Ala Thr 
170 175 180 

aca ttg aga agt gtg ata gta gat gtg atg gat cat aat ggg act gta 
Thr Leu Arg Ser Val He Val Asp Val Met Asp His Asn Gly Thr Val 
185 ' 190 195 200 

caa gag aca ttg aga agt gtg ata gta gat gtg atg gat cat aat ggg 
Gin Glu Thr Le'i Arg Ser Val He Val Asp Val Met Asp His Asn Gly 
205 210 215 



tea gga aat att tea age agt gcg acg acc gtg gaa eta gat gcg gtt 

Ser Gly Asn He Ser Ser Ser Ala Thr Thr Val Glu Leu Asp Ala Val 
250 255 260 

gac gaa gtc ggg cct gtt caa gac aaa ttt gag gcg acc tea tea gga 

Asp Glu Val Gly Pro Val Gin Asp Lys Phe Glu Ala Thr Ser Ser Gly 

265 270 275 280 



244 



292 



340 



388 



436 



484 



532 



580 



628 



676 



act gta caa gag aca ttg aga agt gtg ata gta gat gtg atg gat gat 724 
Thr Val Gin Glu Thr Leu Arg Ser Val He Val Asp Val Met Asp Asp 
220 225 230 

gcg gcg gac aaa get aga gtt gaa gaa gac gta ttt gag ctg gat ttg 772 
Ala Ala Asp Lys Ala Arg Val Glu Glu Asp Val Phe Glu Leu Asp Leu 
235 240 245 



820 



868 
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aat gtt tea aac agt gca acg gta egg gaa gtg gat gca agt gat gaa 

Asn Val Ser Asn Ser Ala Thr Val Arg Glu Val Asp Ala Ser Asp Glu 

285 290 295 

get ggg aat gat caa ggc ata ttt aga gca gat ttg tea gga aat gtt 

Ala Gly Asn Asp Gin Gly He Phe Arg Ala Asp Leu Ser Gly Asn Val 

300 305 310 



aga gaa gtg gat gat gtg gtg gat gaa act aga tea gaa gag gaa aca 
Arg Glu Val Asp Asp Val Val Asp Glu Thr Arg Ser Glu Glu Glu Thr 
380 385 390 

ttt gca atg gat ttg ttt gca agt gaa tea ggc cat gag aaa cat atg 
Phe Ala Met Asp Leu Phe Ala Ser Glu Ser Gly His Glu Lys His Met 
395 400 405 

gca gtg gat tat gtg ggt gaa get acc gat gaa gaa gag act tac caa 
Ala Val Asp Tyr Val Gly Glu Ala Thr Asp Glu Glu Glu Thr Tyr Gin 
410 415 420 

cag caa tat cca gta ccg tct tea ttc tct atg tgg gac aag get att 
Gin Gin Tyr Pro Val Pro Ser Ser Phe Ser Met Trp Asp Lys Ala He 
425 430 435 440 



gat gat tta cca gga caa aac caa teg ate att ggt tec tat aaa caa 
Asp Asp Leu Pro Gly Gin Asn Gin Ser He He Gly Ser Tyr Lys Gin 
475 480 485 



tct agt aaa caa cac egg tea att gtt get ttc ccc aaa caa aac cag 
Ser Ser Lys Gin His Arg Ser He Val Ala Phe Pro Lys Gin Asn Gin 
505 510 515 520 



916 



964 



ttt tea age agt aca aca gtg gaa gtg ggt gca gtg gat gaa get ggg 1012 

Phe Ser Ser Ser Thr Thr Val Glu Val Gly Ala Val Asp Glu Ala Gly 

. . 315 320 325 

tct ata aag gac agg ttt gag acg gat teg tea gga aat gtt tea aca 1060 

Ser He Lys Asp Arg Phe Glu Thr Asp Ser Ser Gly Asn Val Ser Thr 
330 335 340 

agt gcg ccg atg tgg gat gca att gat gaa acc gtg get gat caa gac 

Ser Ala Pro Met Trp Asp Ala He Asp Glu Thr Val Ala Asp Gin Asp 

345 350 355 360 

aca ttt gag gcg gat ttg teg gga aat get tea age tgc gca aca tac 1156 

Thr Phe Glu Ala Asp Leu Ser Gly Asn Ala Ser Ser Cys Ala Thr Tyr 
365 370 375 



1108 



1204 



1252 



1300 



1348 



get aaa aca ggt gta agt ttg aat cct gag ctg cga ctt gtc agg gtt 1396 
Ala Lys Thr Gly Val Ser Leu Asn Pro Glu Leu Arg Leu Val Arg Val 
445 450 455 

gaa gaa caa ggc aaa gta aat ttt agt gat aaa aaa gac ctg tea att 1444 
Glu Glu Gin Gly Lys Val Asn Phe Ser Asp Lys Lys Asp Leu Ser He 
460 465 470 



1492 



gat aaa tea att get gat gtt gcg gga ccg acc caa tea att ttt ggt 1540 
Asp Lys Ser He Ala Asp Val Ala Gly Pro Thr Gin Ser He Phe Gly 
490 495 500 



1588 



tea att gtt agt gtc act gag caa aag. cag tec ata gtt gga ttc cgt 1636 
Ser He Val Ser Val Thr Glu Gin Lys Gin Ser lie Val Gly Phe Arg 
525 530 535 
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agt caa gat ctt teg get gtt agt etc cct aaa caa aac gta cca att 
Ser Gin Asp Leu Ser Ala Val Ser Leu Pro Lys Gin Asn Val Pro lie 
540 545 550 



1684 



gtt ggt acg teg aga gag ggt caa aca aag caa gtt cct gtt gtt gat 1732 

Val Gly Thr Ser Arg Glu Gly Gin Thr Lys Gin Val Pro Val Val Asp 

555 560 565 

aga cag gat gca ttg tat gtg aat gga ctg gaa get aag gag gga gat 

Arg Gin Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp 
570 575 580 



gtt gac aat gtg ttg egg aag cat cag gca gat aga ace caa gca gtg 
Val Asp Asn val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val 
605 610 615 

gaa aag aaa act tgg aag aaa gtt gat gag gaa cat ctt tac atg act 
Glu Lys Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr 
620 625 630 

gaa cat cag aaa cgt get gee gaa gga cag atg gta gtt aac gag gat 
Glu His Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp 
635 640 645 

gag ctt tct ata act gaa att gga atg ggg aga ggt gat aaa att cag 
Glu Leu Ser He Thr Glu He Gly Met Gly Arg Gly Asp Lys He Gin 
650 655 660 

cat gtg ctt tct gag gaa gag ctt tea tgg tct gaa gat gaa gtg cag 
His Val Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin 
665 670 675 680 



ccg caa gca eta aag gtg atg ctg caa gaa etc get gag aaa aat tat 
Pro Gin Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr 
715 720 725 

teg atg agg aac aag ctg ttt gtt ttt cca cag gta gtg aaa get gat 
Ser Met Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp 
730 735 740 

tea gtt att gat ctt tat tta aat cgt gac eta aca get ttg gcg aat 
Ser Val He Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn 
745 750 755 760 



tct tgc aaa ctg tac ata ccc aag gag gec tac aga tta gac ttt gtg 



1780 



cac aca tec gag aaa act gat gag gat gcg ctt cat gta aag ttt aat 1828 
His Thr Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn 
585 590 595 600 



1876 



1924 



1972 



2020 



2068 



tta att gag gat gat gga caa tat gaa gtt gac gag ace tct gtg tec 2116 
Leu He Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser 
685 690 695 

gtt aac gtt gaa caa gat ate cag ggg tea cca cag gat gtt gtg gat 2164 
Val Asn Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp 
700 705 710 



2212 



2260 



2308 



gaa ccc gat gtc gtc ate aaa gga gca ttc aat ggt tgg aaa tgg agg 2356 
Glu Pro Asp Val Val lie Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg 
765 770 775 

ctt ttc act gaa aga ttg cac aag agt gac ctt gga ggg gtt tgg tgg 2404 
Leu Phe Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp 
780 785 790 



2452 
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Ser Cys Lys Leu Tyr He Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val 

795 800 805 

ttc ttc aac ggt cgc acg gtc tat gag aac aat ggc aac aat gat ttc 2500 

Phe Phe Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe 
810 815 820 

tgt ata gga ata gaa ggc act atg aat gaa gat ctg ttt gag gat ttc 2548 

Cys He Gly He Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe 
825 830 835 840 



ttg gtt aaa gaa aag caa agg gag ctt gag aaa ctt gcc atg gaa gaa 
Leu Val Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu 
845 850 855 

get gaa agg agg aca cag act gaa gaa cag egg cga aga aag gaa gca 
Ala Glu Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala 
860 865 870 



ate aag aag aaa aaa ttg caa agt atg ttg agt ttg gcc aga aca tgt 
He Lys Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys 
890 895 900 



act gag att tgg atg cat ggt ggt tac aac aat tgg aca gat gga etc 
Thr Glu He Trp Met His Gly Gly Tyr Asn Asn Trp Thr Asp Gly Leu 
940 945 950 



2596 



2644 



agg get gca gat gaa get gtc agg gca caa gcg aag gcc gag ata gag 2692 
Arg Ala Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu He Glu 
875 880 885 



2740 



gtt gat aat ttg tgg tac ata gag get age aca gat aca aga gga gat 2788 
Val Asp Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp 
905 910 915 920 

act ate agg tta tat tat aac aga aac teg agg cca ctt gcg cat agt 2836 
Thr He Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser 
925 930 935 



2884 



tct att gtt gaa age ttt gtc aag tgc aat gac aaa gac ggc gat tgg 2932 
Ser He Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp 
955 960 965 

tgg tat gca gat gtt att cca cct gaa aag gca ctt gtg ttg gac tgg 2980 
Trp Tyr Ala Asp Val He Pro Pro Glu Lys Ala Leu Val Leu Asp Trp 
970 975 980 

gtt ttt get gat ggg cca get ggg aat gca agg aac tat gac aac aat 3028 
Val Phe Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn 
985 990 995 1000 

get cga caa gat ttc cat get att ctt ccg aac aac aat gta ace gag 3076 
Ala Arg Gin Asp Phe His Ala. He Leu Pro Asn Asn Asn Val Thr Glu 
1005 1010 1015 

gaa ggc ttc tgg gcg caa gag gag caa aac ate tat aca agg ctt ctg 3124 
Glu Gly Phe Trp Ala Gin Glu Glu Gin Asn He Tyr Thr Arg Leu Leu 
1020 1025 1030 

caa gaa agg aga gaa aag gaa gaa ace atg aaa aga aag get gag aga 3172 
Gin Glu Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg 
1035 1040 1045 

agt gca aat ate aaa get gag atg aag gca aaa act atg cga agg ttt 3220 
Ser Ala Asn He Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe 



WO 00/66745 



PCT/AUOO/00385 



-22- 



1050 1055 1060 

ctg ctt tec cag aaa cac att gtt tat acc gaa ccg ctt gaa ata cgt 3268 
Leu Leu Ser Gin Lys His He Val Tyr Thr Glu Pro Leu Glu He Arg 
1065 1070 1075 1080 

gec gga acc aca gtg gat gtg eta tac aat ccc tct aac aca gtg eta 3316 
Ala Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu 
1085 1090 1095 

aat gga aag teg gag ggt tgg ttt aga tgc tec ttt aac ctt tgg atg 3364 
Asn Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met 
1100 1105 1110 

cat tea agt ggg gca ttg cca ccc cag aag atg gtg aaa tea ggg gat 3412 
His Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp 
1115 1120 1125 

ggg ccg etc tta aaa gca aca gtt gat gtt cca ccg gat gee tat atg 34 60 
Gly Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met 
1130 1135 1140 

atg gac ttt gtt ttc tec gag tgg gaa gaa gat ggg ate tat gac aac 3508 
Met Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn 
1145 1150 1155 H60 

agg aat ggg atg gac tat cat att cct gtt tct gat tea att gaa aca 3556 
Arg Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr 
1165 1170 1175 

gag aat tac atg cgt att ate cac att gee gtt gag atg gee ccc gtt 3604 
Glu Asn Tyr Met Arg He lie His He Ala Val Glu Met Ala Pro Val 
1180 1185 1190 

gca aag gtt gga ggt ctt ggg gat gtt gtt aca agt ctt tea cgt gee 3652 
Ala Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala 
1195 1200 1205 

att caa gat eta gga cat act gtc gag gtt att etc ccg aag tac gac 3700 
He Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp 
1210 1215 1220 

tgt ttg aac caa age agt gtc aag gat tta cat tta tat -caa agt ttt 3748 
Cys Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe 
1225 1230 1235 1240 

tct tgg ggt ggt aca gaa ata aaa gta tgg gtt gga cga gtc gaa gac 3796 
Ser Trp Gly Glv Thr Glu He Lys Val Trp Val Gly Arg Val Glu Asp 
1245 1250 1255 

ctg acc gtt tac ttc ctg gaa cct caa aat ggg atg ttt ggc gtt gga 3844 
Leu Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly 
1260 1265 1270 

tgt gta tat gga agg aat gat gac cgc aga ttt ggg ttc ttc tgt cat 3892 
Cys Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His 
1275 1280 1285 

tct get eta gag ttt ate etc cag aat. gaa ttt tct cca cat ata ata 3940 
Ser Ala Leu Glu Phe He Leu Gin Asn Glu Phe Ser Pro His lie lie 
1290 1295 1300 

cat tgc cat gat tgg tea agt get ccg gtc gec tgg eta tat aag gaa 3988 
His Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu 
1305 1310 1315 1320 
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cac tat tec caa tec aga atg gca age act egg gtt gta ttt ace ate 4036 
His Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr lie 
1325 1330 1335 

cac aat ctt gaa ttt gga gca cat tat att ggt aaa gca atg aca tac 4084 
His Asn Leu Glu Phe Gly Ala His Tyr He Gly 'Lys Ala Met Thr Tyr 
1340 1345 1350 

tgt gat aaa gee aca act gtt tct cct aca tat tea agg gac gtg gca 4132 
Cys Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala 
1355 1360 1365 

ggc cat ggc gee att get cct cat cgt gag aaa ttc tac ggc att etc 4180 
Gly His Gly Ala He Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu 
1370 1375 1380 

aat gga att gat cca gat ate tgg gat ccg tac act gac aat ttt ate 4228 
Asn Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He 
1385 1390 1395 1400 

ccg gtc cct tat act tgt gag aat gtt gtc gaa ggc aag aga get gca 4276 
Pro Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala 
1405 1410 1415 

aaa agg gee ttg cag cag aag ttt gga tta cag caa act gat gtc cct 4324 
Lys Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro 
1420 1425 1430 

att gtc gga ate ate ace cgt ctg- aca gee cag aag gga ate cac etc 4372 
He Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly He His Leu 
1435 1440 1445 

ate aag cac gca att cac cga act etc gaa age aac gga cat gtg gtt 4420 
He Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly His Val Val 
1450 1455 1460 

ttg ctt ggt tea get cca gat cat cga ata caa ggc gat ttt tgc aga 44 68 
Leu Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg . 
1465 1470 1475 1480 

ttg gee gat get ctt cat ggt gtt tac cat ggt agg gtg aag ctt gtt 4516 
Leu Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val 
1485 1490 1495 

eta ace tat gat gag cct ctt tct cac ctg ata tac get ggc teg gac 4564 
Leu Thr Tyr Asp Glu Pro Leu Ser His Leu He Tyr Ala Gly Ser Asp 
1500 1505 1510 

ttc ata att gtt cct tea ate ttc gaa ccc tgt ggc tta aca caa ctt 4 612 
Phe He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu 
1515 1520 1525 

gtt gee atg cgt tat gga teg ate cct ata gtt egg aaa act gga gga 4 660 
Val Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly 
1530 1535 1540 

ctt cac gac aca gtc ttc gac gta gac aat gat aag gac egg get egg 4708 
Leu His Asp Thr Val Phe Asp Val Asp Ash Asp Lys Asp Arg Ala Arg 
1545 1550 1555 1560 

tct ctt ggt ctt gaa cca aat ggg ttc agt ttc gac gga gee gac age 4756 
Ser Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser 
1565 1570 1575 
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aat ggc gtg gat tat gcc etc aac aga gca ate ggc get tgg ttc gat 
Asn Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp Phe Asp 
1580 1585 1590 

gcc cgt gat tgg ttc cac tec ctg tgt aag agg gtc atg gag caa gac 
Ala Arg Asp Trp- Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp 
1595 1600 1605 

tgg teg tgg aac egg ccc gca ctg gac tac att gaa ttg tac cat gcc 
Trp Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala 
1610 1615 1620 

get cga aaa ttc tgacacccaa ctgaaccaat gacaagaaca agegcattgt 

Ala Arg Lys Phe 

1625 



4804 



4852 



4900 



4952 



gggatcgact 


agtcatacag 


ggctgtgcag 


ategtcttge ttcagttagt gccctcttca 


5012 


gttagttcca 


agcgcactac 


agtegtacat 


agctgaggat cctcttgcct cctaccaggg 


5072 


ggaacaaagc 


agaaatgeat 


gagtgcattg 


ggaagacttt tatgtatatt gttaaaaaaa 


5132 


tttccttttc 


ttttccttcc 


ctgcacctgg 


aaatggttaa gcgcatcgcc gagataagaa 


5192 


ccgcagtgac 


attctgtgag 


tagctttgta 


tattctctca tcttgtgaaa actaatgttc 


5252 


atgttaggct 


gtctgatcat 


gtggaagctt 


tgttatatgt tacttatggt atacatcaat 


5312 


gatatttaca 


tttgtggaaa 


aaaaaaaaaa 


aaaa 


5346 



<210> 8 
<211> 1628 
<212> PRT 

<213> Triticum aestivum 
<400> 8 

Met Glu Met Ser Leu Trp Pro Arg Ser Pro Leu Cys Pro Arg Ser Arg 
1 5 10 15 

Gin Pro Leu Val Val Val Arg Pro Ala Gly Arg Gly Gly Leu Thr Gin 
20 25 30 

Pro Phe Leu Met Asn Gly Arg Phe Thr Arg Ser Arg Thr Leu Arg Cys 
35 40 45 

Met Val Ala Ser Ser Asp Pro Pro Asn Arg Lys Ser Arg Arg Met Val 
50 55 60 

Pro Pro Gin Val Lys Val He Ser Ser Arg Gly Tyr Thr Thr Arg Leu 
65 70 75 30 

He Val Glu Pro Ser Asn Glu Asn Thr Glu His Asn Asn Arg Asp Glu 
85 90 95 

Glu Thr Leu Asp Thr Tyr Asn Ala Leu Leu Ser Thr Glu Thr Ala Glu 
100 105 110 

Trp Thr Asp Asn Arg Glu Ala Glu Thr Ala Lys Ala Asp Ser Ser Gin 
115 120 125 

Asn Ala Leu Ser Ser Ser He He Gly Glu Val Asp Val Ala Asp Glu 
130 135 140 

Asp He Leu Ala Ala Asp Leu Thr Val Tyr Ser Leu Ser Ser Val Met 
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145 



150 



155 



160 



Lys Lys Glu Val Asp Ala Ala Asp Lys Ala Arg Val Lys Glu Asp Ala 
165 170 175 

Phe Glu Leu Asp Leu Pro Ala Thr Thr Leu Arg Ser Val He Val Asp 
180 185 190 

Val Met Asp His Asn Gly Thr Val Gin Glu Thr Leu Arg Ser Val He 
195 200 205 

Val Asp Val Met Asp His Asn Gly Thr Val Gin Glu Thr Leu Arg Ser 
210 215 220 

Val He Val Asp Val Met Asp Asp Ala Ala Asp Lys Ala Arg Val Glu 
225 230 235 240 

Glu Asp Val Phe Glu Leu Asp Leu Ser Gly Asn He Ser Ser Ser Ala 
245 250 255 

Thr Thr Val Glu Leu Asp Ala Val Asp Glu Val Gly Pro Val Gin Asp 
260 265 270 

Lys Phe Glu Ala Thr Ser Ser Gly Asn Val Ser Asn Ser Ala Thr Val 
275 280 285 

Arg Glu Val Asp Ala Ser Asp Glu Ala Gly Asn Asp Gin Gly He Phe 
290 295 300 

Arg Ala Asp Leu Ser Gly Asn Val Phe Ser Ser Ser Thr Thr Val Glu 
305 310 315 320 

Val Gly Ala Val Asp Glu Ala Gly Ser He Lys Asp Arg Phe Glu Thr 
325 330 335 

Asp Ser Ser Gly Asn Val Ser Thr Ser Ala Pro Met Trp Asp Ala He 
340 345 350 

Asp Glu Thr Val Ala Asp Gin Asp Thr Phe Glu Ala Asp Leu Ser Gly 
355 360 365 

Asn Ala Ser Ser Cys Ala Thr Tyr Arg Glu Val Asp Asp Val Val Asp 
370 375 380 

Glu Thr Arg Ser Glu Glu Glu Thr Phe Ala Met Asp Leu Phe Ala Ser 
385 * 390 395 400 

Glu Ser Gly His Glu Lys His Met Ala Val Asp Tyr Val Gly Glu Ala 
405 410 415 

Thr Asp Glu Glu Glu Thr Tyr Gin Gin Gin Tyr Pro Val Pro Ser Ser 
420 425 430 

Phe Ser Met Trp Asp Lys Ala He Ala Lys Thr Gly Val Ser Leu Asn 
435 440 445 

Pro Glu Leu Arg Leu Val Arg Val Glu Glu Gin Gly Lys Val Asn Phe 
450 455 460 

Ser Asp Lys Lys Asp Leu Ser He Asp Asp Leu Pro Gly Gin Asn Gin 
465 470 475 480 

Ser lie He Gly Ser Tyr Lys Gin Asp Lys Ser He Ala Asp Val Ala 
485 490 495 
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Gly Pro Thr Gin Ser lie Phe Gly Ser Ser Lys Gin His Arg Ser He 
500 505 510 

Val Ala Phe Pro Lys Gin Asn Gin Ser He Val Ser Val Thr Glu Gin 
515 " 520 525 

Lys Gin Ser He Val Gly Phe Arg Ser Gin Asp Leu Ser Ala Val Ser 
530 535 540 

Leu Pro Lys Gin Asn Val Pro He Val Gly Thr Ser Arg Glu Gly Gin 
545 550 555 560 

Thr Lys Gin Val Pro Val Val Asp Arg Gin Asp Ala Leu Tyr Val Asn 
565 570 575 

Gly Leu Glu Ala Lys Glu Gly Asp His Thr Ser Glu Lys Thr Asp Glu 
580 585 590 

Asp Ala Leu His Val Lys Phe Asn Val Asp Asn Val Leu Arg Lys His 
595 600 605 

Gin Ala Asp Arg Thr Gin Ala Val Glu Lys Lys Thr Trp Lys Lys Val 
610 615 620 

Asp Glu Glu His Leu Tyr Met Thr Glu His Gin Lys Arg Ala Ala Glu 
625 " 630 635 640 

Gly Gin Met Val Val Asn Glu Asp Glu Leu Ser He Thr Glu He Gly 
645 650 655 

Met Gly Arg Gly Asp Lys He Gin His Val Leu Ser Glu Glu Glu Leu 
660 665 670 

Ser Trp Ser Glu Asp Glu Val Gin Leu He Glu Asp Asp Gly Gin Tyr 
675 680 685 

Glu Val Asp Glu Thr Ser Val Ser Val Asn Val Glu Gin Asp He Gin 
690 695 700 

Gly Ser Pro Gin Asp Val Val Asp Pro Gin Ala Leu Lys Val Met Leu 
705 710 715 720 

Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met Arg Asn Lys Leu Phe Val 
725 730 735 

Phe Pro Glu Val Val Lys Ala Asp Ser Val He Asp Leu Tyr Leu Asn 
740 745 750 

Arg Asp Leu rhr Ala Leu Ala Asn Glu Pro Asp Val Val He Lys Gly 
755 760 765 

Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe Thr Glu Arg Leu His Lys 
770 775 780 

Ser Asp Leu Gly Gly. Val Trp Trp Ser Cys Lys Leu Tyr He Pro Lys 
785 790 795 800 

Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe Asn Gly Arg Thr Val Tyr 
805 810 815 

Glu Asn Asn Gly Asn Asn Asp Phe Cys lie Gly He Glu Gly Thr Met 
820 825 830 



Asn Glu Asp Leu Phe Glu Asp Phe Leu Val Lys Glu Lys Gin Arg Glu 
835 840 845 
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Leu Glu Lys Leu Ala Met Glu Glu Ala Glu Arg Arg Thr Gin Thr Glu 
850 855 860 

Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala Ala Asp Glu Ala Val Arg 
665 870 875 880 

Ala Gin Ala Lys Ala Glu lie Glu He Lys Lys Lys Lys Leu Gin Ser 
885 890 895 

Met Leu Ser Leu Ala Arg Thr Cys Val Asp Asn Leu Trp Tyr He Glu 
900 905 910 

Ala Ser Thr Asp Thr Arg Gly Asp Thr He Arg Leu Tyr Tyr Asn Arg 
915 920 925 

Asn Ser. Arg Pro Leu Ala His Ser Thr Glu He Trp Met His Gly Gly 
930 935 940 

Tyr Asn Asn Trp Thr Asp Gly Leu Ser He Val Glu Ser Phe Val Lys 
945. 950 955 960 

Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr Ala Asp Val He Pro Pro 
965 970 975 

Glu Lys Ala Leu Val Leu Asp Trp Val Phe Ala Asp Gly Pro Ala Gly 
980 985 990 

Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg Gin Asp Phe His Ala He 
995 1000 1005 

Leu Pro Asn Asn Asn Val Thr Glu Glu Gly Phe Trp Ala Gin Glu Glu 
1010 1015 1020 

Gin Asn lie Tyr Thr Arg Leu Leu Gin Glu Arg Arg Glu Lys Glu Glu 
025 1030 1035 1040 

Thr Met Lys Arg Lys Ala Glu Arg Ser Ala Asn He Lys Ala Glu Met 
1045 1050 1055 

Lys Ala Lys Thr Met Arg Arg Phe Leu Leu Ser Gin Lys His He Val 
1060 1065 1070 

Tyr Thr Glu Pro Leu Glu He Arg Ala Gly Thr Thr Val Asp Val Leu 
1075 1080 1085 

Tyr Asn Pro Ser Asn Thr Val Leu Asn Gly Lys Ser Glu Gly Trp Phe 
1090 1095 1100 

Arg Cys Ser Phe Asn Leu Trp Met His Ser Ser Gly Ala Leu Pro Pro 
105 1110 1H5 1120 

Gin Lys Met Val Lys Ser Gly Asp Gly Pro Leu Leu Lys Ala Thr Val 
1125 1130 1135 

Asp Val Pro Pro Asp Ala Tyr Met Met Asp Phe Val Phe Ser Glu Trp 
1140 . 1145 1150 

Glu Glu Asp Gly He Tyr Asp Asn Arg Asn Gly Met Asp Tyr His He 
1155 1160 1165 

Pro Val Ser Asp Ser He Glu Thr Glu Asn Tyr Met Arg He He His 
1170 1175 1180 

He Ala Val Glu Met Ala Pro Val Ala Lys Val Gly Gly Leu Gly Asp 
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185 



1190 



1195 



1200 



val Val Thr Ser Leu Ser Arg Ala He Gin Asp Leu Gly His Thr Val 
1205 1210 1215 

Glu Val He Leu Pro Lys Tyr Asp Cys Leu Asn Gin Ser Ser Val Lys 
1220 1225 1230 

Asp Leu His Leu Tyr Gin Ser Phe Ser Trp Gly Gly Thr Glu He Lys 
1235 1240 1245 

Val Trp Val Gly Arg Val Glu Asp Leu Thr Val Tyr Phe Leu Glu Pro 
1250 1255 1260 

Gin Asn Gly Met Phe Gly Val Gly Cys Val Tyr Gly Arg Asn Asp Asp 
265 1270 1275 1280 

Arg Arg Phe Gly Phe Phe Cys His Ser Ala Leu Glu Phe He Leu Gin 
1285 1290 1295 

Asn Glu Phe Ser Pro His He He His Cys His Asp Trp Ser Ser Ala 
1300 1305 1310 

Pro Val Ala Trp Leu Tyr Lys Glu His Tyr Ser Gin Ser Arg Met Ala 
1315 1320 1325 

Ser Thr Arg Val Val Phe Thr lie His Asn Leu Glu Phe Gly Ala His 
1330 1335 1340 

Tyr He Gly Lys Ala Met Thr Tyr Cys Asp Lys Ala Thr Thr Val Ser 
345 ' 1350 1355 1360 

Pro Thr Tyr Ser Arg Asp Val Ala Gly His Gly Ala He Ala Pro His 
1365 1370 1375 

Arg Glu Lys Phe Tyr Gly He Leu Asn Gly He Asp Pro Asp He Trp 
1380 1385 1390 

Asp Pro Tyr Thr Asp Asn Phe lie Pro Val Pro Tyr Thr Cys Glu Asn 
1395 " 1400 1405 

Val Val Glu Gly Lys Arg Ala Ala Lys Arg Ala Leu Gin Gin Lys Phe 
1410 1415 1420 

Gly Leu Gin Gin Thr Asp Val Pro He Val Gly He He Thr Arg Leu 
425 1430 1435 1440 

Thr Ala Gin Lys Gly lie His Leu lie Lys His Ala lie His Arg Thr 
1445 1450 1455 

Leu Glu Ser Asn Gly His Val Val Leu Leu Gly Ser Ala Pro Asp His 
1460 1465 1470 

Arg lie Gin Gly Asp Phe Cys Arg Leu Ala Asp Ala Leu His Gly Val 
1475 1480 1485 

Tyr His Gly Arg Val Lys Leu Val Leu Thr Tyr Asp Glu Pro Leu Ser 
1490 " 1495 1500 

His Leu lie Tyr Ala Gly Ser Asp Phe lie lie Val Pro Ser lie Phe 
505 1510 1515 1520 

Glu Pro Cys Gly Leu Thr Gin Leu Val Ala Met Arg Tyr Gly Ser lie 



1525 



1530 



1535 
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Pro He Val Arg Lys Thr Gly Gly Leu His Asp Thr Val Phe Asp Val 
1540 1545 1550 

Asp Asn Asp Lys Asp Arg Ala Arg Ser Leu Gly Leu Glu Pro Asn Gly 
1555 1560 1565 

Phe Ser Phe Asp Gly Ala Asp Ser Asn Gly Val Asp Tyr Ala Leu Asn 
1570 1575 1580 

Arg Ala He Gly Ala Trp Phe Asp Ala Arg Asp Trp Phe His Ser Leu 
585 1590 1595 1600 

Cys Lys Arg Val Met Glu Gin Asp Trp Ser Trp Asn Arg Pro Ala Leu 
1605 1610 1615 

Asp Tyr He Glu Leu Tyr His Ala Ala Arg Lys Phe 
1620 1625 



<210> 9 
<2U> 3621 
<212> DNA 

<213> Triticum aestivum 

<220> 

<221> CDS 

<222> (1) (3177) 

<400> 9 

gat gca ttg tat gtg aat gga ctg gaa get aag gag gga gat cac aca 

Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr 

1 5 10 15 



48 



tec gag aaa act gat gag gat gcg ctt cat gta aag ttt aat gtt gac 96 
Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn Val Asp 
20 25 30 

aat gtg ttg egg aag cat cag gca gat aga acc caa gca gtg gaa aag 144 
Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys 
35 " 40 45 

aaa act tgg aag aaa gtt gat gag gaa cat ctt tac atg act gaa cat 192 
Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr Glu His 
50 55 60 

cag aaa cgt get gee gaa gga cag atg gta gtt aac gag gat gag ctt 240 
Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu 
65 70 75 80 

tct ata act gaa att gga atg ggg aga ggt gat aaa att cag cat gtg 288 
Ser He Thr Glu He Gly Met Gly Arg Gly Asp Lys He Gin His Val 
85 90 95 

ctt tct gag gaa gag ctt tea tgg tct gaa gat gaa gtg cag tta att 336 
Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin Leu He 
100 105 110 

gag gat gat gga caa tat gaa gtt gac gag acc tct gtg tec gtt aac 384 
Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn 
115 120 125 

gtt gaa caa gat ate cag ggg tea cca cag gat gtt gtg gat ccg caa 432 
Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin 
130 135 140 
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gca eta aag gtg atg ctg caa gaa etc get gag aaa aat tat teg atg 

Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met 

145 150 155 160 

agg aac aag ctg ttt gtt ttt cca gag gta gtg aaa get gat tea gtt 

Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp Ser Val 

165 170 175 

att gat ctt tat tta aat cgt gac eta aca get ttg gcg aat gaa ccc 

lie Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro 

180 185 190 

gat gtc gtc ate aaa gga gca ttc aat ggt tgg aaa tgg agg ctt ttc 

Asp Val Val lie Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe 

195 200 205 



480 



528 



57 6 



624 



816 



act gaa aga ttg cac aag agt gac ctt gga ggg gtt tgg tgg tct tgc 672 
Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cys 
210 215 220 

aaa ctg tac ata ccc aag gag gee tac aga tta gac ttt gtg ttc ttc 720 
Lys Leu Tyr He Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe 
225 230 235 240 

aac ggt cgc acg gtc tat gag aac aat ggc aac aat gat ttc tgt ata 768 
Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys He 
245 < 250 255 

gga ata gaa ggc act atg aat gaa gat ctg ttt gag gat ttc ttg gtt 
Gly He Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val 
260 265 270 

aaa gaa aag caa agg gag ctt gag aaa ctt gee atg gaa gaa get gaa 864 
Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu Ala Glu 
275 280 285 

agg agg aca cag act gaa gaa cag egg cga aga aag gaa gca agg get 912 
Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala 
290 295 300 

gca gat gaa get gtc agg gca caa gcg aag gec gag ata gag ate aag 960 
Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu He Glu He Lys 
305 310 315 320 

aag aaa aaa ttg caa agt atg ttg agt ttg gee aga aca tgt gtt gat 1008 
Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp 
325 330 335 

aat ttg tgg tac ata gag get age aca gat aca aga gga gat act ate 1056 
Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr He 
340 345 350 

agg tta tat tat aac aga aac teg agg cca ctt gcg cat agt act gag 1104 
Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu 
355 360 365 

att tgg atg cat ggt ggt tac aac aat tgg tea gat gga etc tct att 1152 
He Trp Met His Gly Gly Tyr Asn Asn Trp Ser Asp Gly Leu Ser He 
370 375 380 

gtt gaa age ttt gtc aag tgc aat gac aaa gac ggc gat tgg tgg tat 1200 
Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr 
385 390 395 400 

gca gat gtt att cca cct gaa aag gca ctt gtg ttg gac tgg gtt ttt 1248 
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Ala Asp Val lie Pro Pro Glu Lys Ala Leu Val Leu Asp Trp val Phe 

405 410 415 

get gat ggg cca get ggg aat gca agg aac tat gac aac aat get cga 

Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg 
420 425 430 

caa gat ttc cat get att ctt ccg aac aac aat gta acc gag gaa ggc 

Gin Asp Phe His Ala He Leu Pro Asn Asn Asn Val Thr Glu Glu Gly 
435 440 445 

ttc tgg gcg caa gag gag caa aac ate tat aca agg ctt.ctg caa gaa 

Phe Trp Ala Gin Glu Glu Gin Asn He Tyr Thr Arg Leu Leu Gin Glu 

450 455 460 

agg aga gaa aag gaa gaa acc atg aaa aga aag get gag aga agt gca 

Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala 

465 470 475 480 

aat ate aaa get gag atg aag gca aaa act atg cga agg ttt ctg ctt 

Asn He Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu 

485 490 495 . 

tec cag aaa cac att gtt tat acc cga acc gnc ttg aaa tac gtg ccc 

Ser Gin Lys His He Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro 
500 505 510 

gga acc aca gtg gat gtg eta tac aat ccc tct aac aca gtg eta aat 

Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn 
515 520 525 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



gga aag teg gag ggt tgg ttt aga tgc tec ttt aac ctt tgg atg cat 1632 
Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His 
530 " 535 540 

tea agt ggg gca ttg cca ccc cag aag atg gtg aaa tea ggg gat ggg 1680 
Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly 
545 550 555 560 

ccg etc tta aaa gca aca gtt gat gtt cca ccg gat gee tat atg atg 
Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met 
565 570 575 

gac ttt gtt ttc tec gag tgg gaa gaa gat ggg ate tat gac aac agg 177 6 
Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn Arg 
580 585 590 



1728 



1872 



1920 



aat ggg atg gac tat cat att cct gtt tct gat tea att gaa aca gag 1824 
Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr Glu 
595 600 605 

aat tac atg cgt att ate cac att gee gtt gag atg gee ccc gtt gca 
Asn Tyr Met Arg . He He His He Ala Val Glu Met Ala Pro Val Ala 
610 " . 615 620 

aag gtt gga ggt ctt ggg gat gtt gtt aca agt ctt tea cgt gec att 
Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala He 
625 630 635 640 

caa gat eta gga cat act gtc gag gtt att etc ccg aag tac gac tgt 1968 
Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp Cys 
645 650 655 

ttg aac caa age agt gtc aag gat tta cat tta tat caa agt ttt tct 2016 
Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser 
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660 • 665 670 

tgg ggt ggt aca gaa ata aaa gta tgg gtt gga cga gtc gaa gac ctg 2064 

Trp Gly Gly Thr Glu He Lys Val Trp Val Gly Arg Val Glu Asp Leu 
675 680 685 

acc gtt tac ttc ctg gaa cct caa aat ggg atg ttt ggc gtt gga tgt 2112 

Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys 
690 695 700 . 

gta tat gga agg aat gat gac cgc aga ttt ggg ttc ttc tgt cat tct 2160 

Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser 

705 710 715 720 



get eta gag ttt ate etc cag aat gaa ttt tct cca cat ata ata cat 
Ala Leu Glu Phe lie Leu Gin Asn Glu Phe Ser Pro His He He His 
725 730 735 



aat ctt gaa ttt gga gca cat tat att ggt aaa gca atg aca tac tgt 
Asn Leu Glu Phe Gly Ala His Tyr He Gly Lys Ala Met Thr Tyr Cys 
770 775 780 



2208 



tgc cat gat tgg tea agt get ccg gtc gee tgg eta tat aag gaa cac 2256 

Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His 

740 745 . 750 

tat tec caa tec aga atg gca age act egg gtt gta ttt acc ate cac 2304 

Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr He His 
755 760 765 



2352 



2496 



2544 



gat aaa gee aca act gtt tct cct aca tat tea agg gac gtg gca ggc 2400 
Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly 
785 790 795 800 

cat ggc gec att get cct cat cgt gag aaa ttc tac ggc att etc aat 2448 
His Gly Ala He Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu Asn 
805 810 815 

gga att gat cca gat ate tgg gat ccg tac act gac aat ttt ate ccg 
Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He Pro 
820 825 830 

gtc cct tat act tgt gag aat gtt gtc gaa ggc aag agg get gca aaa 
Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys 
835 840 845 

agg gee ttg cag cag aag ttt gga tta cag caa act gat gtc cct att 2592 
Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro lie 
850 855 860 

gtc gga ate ate acc cgt ctg aca gca cag aag gga ate cac etc ate 2640 
Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly lie His Leu lie 
865 870 875 880 

aag cac gca att cac. cga acc etc gag age aat gga caa gtg gtt ttg 2688 
Lys His Ala lie His Arg Thr Leu Glu Ser Asn Gly Gin Val Val Leu 
885 890 895 

ctt ggt tea get cca gat cat cga ata caa ggc gat ttt tgc aga ttg 2736 
Leu Gly Ser Ala Pro Asp His Arg lie Gin Gly Asp Phe Cys Arg Leu 
900 905 910 

gee gat get ctt cac ggt gtt tac cat ggt agg gtg aag ctt gtt eta 2784 
Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val Leu 
915 920 .925 



WO 00/66745 



PCT/AU00/00385 



-33- 



acc tac gat gag cct ctt tct cac ctg ata tac get ggc tec gac ttc 
Thr Tyr Asp Glu Pro Leu Ser His Leu lie Tyr Ala Gly Ser Asp Phe 
930 935 940 

att att gtc cct tea ate ttt gaa ccc tgt ggc tta aca caa ctt gtt 
He He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val 
945 950 955 960 

gec atg cgt tat gga teg ate cct ata gtt egg aaa acc gga gga ctt 
Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly Leu 
965 970 975 

tac gac act gtc ttc gac gta gac aat gat aag gac egg get egg tct 
Tyr Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser 
980 985 990 

ctt ggt ctt gaa cca aat ggg ttc agt ttc gac gga gee gac age aat 
Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser Asn 
995 1000 1005 

ggc gtg gat tat gee etc aac aga gca ate ggc get tgg ttc gat gee 
Gly Val Asp Tyr Ala Leu Asn Arg Ala He Gly Ala Trp Phe Asp Ala 
1010 1015 1020 



2832 



2880 



2928 



2976 



3024 



3072 



cgt gat tgg ttc cac tec ctg tgt aag agg gtc atg gag caa gac tgg 3120 
Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp Trp 
1025 1030 1035 1040 

teg tgg aac egg cct gca ctg gac tac att gaa ttg tac cat gee get 3168 
Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala Ala 
1045 1050 1055 

cga aaa ttc tgacacccaa ctgaaccaat ggcaagaaca agegcattgt 3217 
Arg Lys Phe 

gggatcgact acagtcatac agggctgtgc agategtett gcttcagtta gtgccctctt 3277 
cagttagttc caagcgcact acagtegtae atagctgagg atcctcttgc ctcctccacc 3337 
aggggaaaca aagcagaaat gcataagtgc attgggaaga cttttatgta tattgttaaa 3397 
tttttccttt tcttttcctt ccctgcacct ggaaatggtt aagegcateg ccgagataag 3457 
aaccacagta acattctgtg agtagctttg tatattctct catcttgtga aaactaatgt 3517 
gcatgttagg ctctctgatc atgtggaagc tttgttatat gttacttatg gttatatggt 3577 
atacatcaat gatatttaca tttgtggaaa aaaaaaa^aa aaaa 3621 



<210> 10 
<211> 1059 
<212> PRT 

<213> Triticura aestivum 
<400> 10 

Asp Ala Leu Tyr Val Asn Gly Leu Glu Ala Lys Glu Gly Asp His Thr 

1. 5 10 15 

Ser Glu Lys Thr Asp Glu Asp Ala Leu His Val Lys Phe Asn Val Asp 
20 25 30 



Asn Val Leu Arg Lys His Gin Ala Asp Arg Thr Gin Ala Val Glu Lys 
35 40 45 
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Lys Thr Trp Lys Lys Val Asp Glu Glu His Leu Tyr Met Thr Glu His 
50 55 60 

Gin Lys Arg Ala Ala Glu Gly Gin Met Val Val Asn Glu Asp Glu Leu 
65 70 75 80 

Ser He Thr Glu He Gly Met Gly Arg Gly Asp Lys lie Gin His Val 
85 90 95 

Leu Ser Glu Glu Glu Leu Ser Trp Ser Glu Asp Glu Val Gin Leu He 
100 105 110 

Glu Asp Asp Gly Gin Tyr Glu Val Asp Glu Thr Ser Val Ser Val Asn 
115 120 125 

Val Glu Gin Asp He Gin Gly Ser Pro Gin Asp Val Val Asp Pro Gin 
130 135 140 

Ala Leu Lys Val Met Leu Gin Glu Leu Ala Glu Lys Asn Tyr Ser Met 
145 150 155 160 

Arg Asn Lys Leu Phe Val Phe Pro Glu Val Val Lys Ala Asp Ser Val 
165 170 175 

He Asp Leu Tyr Leu Asn Arg Asp Leu Thr Ala Leu Ala Asn Glu Pro 
180 185 190 

Asp Val Val He Lys Gly Ala Phe Asn Gly Trp Lys Trp Arg Leu Phe 
195 200 205 

Thr Glu Arg Leu His Lys Ser Asp Leu Gly Gly Val Trp Trp Ser Cys . 
210 215 220 

Lys Leu Tyr He Pro Lys Glu Ala Tyr Arg Leu Asp Phe Val Phe Phe. 
225 230 235 240 

Asn Gly Arg Thr Val Tyr Glu Asn Asn Gly Asn Asn Asp Phe Cys He 
245 250 255 

Gly He Glu Gly Thr Met Asn Glu Asp Leu Phe Glu Asp Phe Leu Val 
260 265 270 

Lys Glu Lys Gin Arg Glu Leu Glu Lys Leu Ala Met Glu Glu Ala Glu 
275 280 285 

Arg Arg Thr Gin Thr Glu Glu Gin Arg Arg Arg Lys Glu Ala Arg Ala 
290 295 300 

Ala Asp Glu Ala Val Arg Ala Gin Ala Lys Ala Glu He Glu lie Lys 
305 310 315 320 

Lys Lys Lys Leu Gin Ser Met Leu Ser Leu Ala Arg Thr Cys Val Asp 
325 330 335 

Asn Leu Trp Tyr He Glu Ala Ser Thr Asp Thr Arg Gly Asp Thr He 
340 345 350 

Arg Leu Tyr Tyr Asn Arg Asn Ser Arg Pro Leu Ala His Ser Thr Glu 
355 360 365 

He Trp Met His Gly Gly Tyr Asn Asn Trp Ser Asp Gly Leu Ser He 
370 375 380 



Val Glu Ser Phe Val Lys Cys Asn Asp Lys Asp Gly Asp Trp Trp Tyr 
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385 



390 



395 



400 



Ala Asp Val He Pro Pro Glu Lys Ala Leu Val Leu Asp Trp Val Phe 
405 410 415 

Ala Asp Gly Pro Ala Gly Asn Ala Arg Asn Tyr Asp Asn Asn Ala Arg 
420 425 430 

Gin Asp Phe. His Ala He Leu Pro Asn Asn Asn Val Thr Glu Glu Gly 
435 440 445 

Phe Trp Ala Gin Glu Glu Gin Asn He Tyr Thr Arg Leu Leu Gin Glu 
450 455 460 

Arg Arg Glu Lys Glu Glu Thr Met Lys Arg Lys Ala Glu Arg Ser Ala 
465 470 475 480 

Asn He Lys Ala Glu Met Lys Ala Lys Thr Met Arg Arg Phe Leu Leu 
485 490 495 

Ser Gin Lys His He Val Tyr Thr Arg Thr Xaa Leu Lys Tyr Val Pro 
500 505 510 

Gly Thr Thr Val Asp Val Leu Tyr Asn Pro Ser Asn Thr Val Leu Asn 
515 520 525 

Gly Lys Ser Glu Gly Trp Phe Arg Cys Ser Phe Asn Leu Trp Met His 
530 535 540 

Ser Ser Gly Ala Leu Pro Pro Gin Lys Met Val Lys Ser Gly Asp Gly 
545 ' 550 555 560 

Pro Leu Leu Lys Ala Thr Val Asp Val Pro Pro Asp Ala Tyr Met Met 
565 570 575 

Asp Phe Val Phe Ser Glu Trp Glu Glu Asp Gly He Tyr Asp Asn Arg 
580 585 590 

Asn Gly Met Asp Tyr His He Pro Val Ser Asp Ser He Glu Thr Glu 
595 ^ 600 605 

Asn Tyr Met Arg He He His He Ala Val Glu Met Ala Pro Val Ala 
610 615 620 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala He 
625 630 635 640 

Gin Asp Leu Gly His Thr Val Glu Val He Leu Pro Lys Tyr Asp Cys 
64 5 650 655 

Leu Asn Gin Ser Ser Val Lys Asp Leu His Leu Tyr Gin Ser Phe Ser 
660 665 670 

Trp Gly Gly Thr Glu He Lys Val Trp Val Gly Arg Val Glu Asp Leu 
675 680 685 

Thr Val Tyr Phe Leu Glu Pro Gin Asn Gly Met Phe Gly Val Gly Cys 
690 695 700 

Val Tyr Gly Arg Asn Asp Asp Arg Arg Phe Gly Phe Phe Cys His Ser 
705 710 715 720 

Ala Leu Glu Phe He Leu Gin Asn Glu Phe Ser Pro His He He His 
725 730 735 
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Cys His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His 
740 745 750 

Tyr Ser Gin Ser Arg Met Ala Ser Thr Arg Val Val Phe Thr He His 
755 760 765 

Asn Leu Glu Phe Gly Ala His Tyr He Gly Lys Ala Met Thr Tyr Cys 
770 775 780 

Asp Lys Ala Thr Thr Val Ser Pro Thr Tyr Ser Arg Asp Val Ala Gly 
785 790 795 800 

His Gly Ala He Ala Pro His Arg Glu Lys Phe Tyr Gly He Leu Asn 
805 810 815 

Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp Asn Phe He Pro 
820 825 830 

Val Pro Tyr Thr Cys Glu Asn Val Val Glu Gly Lys Arg Ala Ala Lys 
835 840 845 

Arg Ala Leu Gin Gin Lys Phe Gly Leu Gin Gin Thr Asp Val Pro He 
850 855 860 

Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly lie His Leu He 
865 870 875 880 

Lys His Ala He His Arg Thr Leu Glu Ser Asn Gly Gin Val Val Leu 
885 890 895 

Leu Gly Ser Ala Pro Asp His Arg He Gin Gly Asp Phe Cys Arg Leu 
900 905 910 

Ala Asp Ala Leu His Gly Val Tyr His Gly Arg Val Lys Leu Val Leu 
915 920 925 

Thr Tyr Asp Glu Pro Leu Ser His Leu lie Tyr Ala Gly Ser Asp Phe 
930 935 940 

lie He Val Pro Ser He Phe Glu Pro Cys Gly Leu Thr Gin Leu Val 
945 950 955 960 

Ala Met Arg Tyr Gly Ser He Pro He Val Arg Lys Thr Gly Gly Leu 
965 970 975 

Tyr Asp Thr Val Phe Asp Val Asp Asn Asp Lys Asp Arg Ala Arg Ser 
980 985 990 

Leu Gly Leu Glu Pro Asn Gly Phe Ser Phe Asp Gly Ala Asp Ser Asn 
995 1000 1005 

Gly Val Asp Tyr Ala Leu Asn Arg Ala lie Gly Ala Trp Phe Asp Ala 
1010 1015 1020 

Arg Asp Trp Phe His Ser Leu Cys Lys Arg Val Met Glu Gin Asp Trp 
1025 1030 1035 1040 

Ser Trp Asn Arg Pro Ala Leu Asp Tyr He Glu Leu Tyr His Ala Ala 
1045 1050 1055 

Arg Lys Phe 



<210> 11 
<211> 728 
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<212> DNA 

<213> Triticum sp. 

<400> 11 



gatcttgaac 


ggcacgtgaa 


agacttgtaa caacatcccc 


gagacctcca 


acctatgaga 


60 


tcatcgatca 


tgacagagca 


tagtattatg gcatagaatg 


aaaaaaaggc 


ataaggtgat 


120 


gagatctcca 


cgccagagcg 


ttgtattcca attttagttc 


tttccccgtg 


aggaggggag 


180 


gctaggcggg 


cgaggcagag 


gggatagggc agtcgccgct 


gcgtggtgga 


ctgactggtg 


240 


tggtgggtgg 


tgggttttgc 


gggcggggtt tagtaggttc 


ccggaaatgg 


agatggctct 


300 


ccggccacgg 


agccctctgt 


gccctcggag cagtcagccg 


ctcgtcgtcg 


tccggccggc 


360 


cggccgcggc 


ggcggcctcg 


cgcaggtacg ggtgattatg 


gttcttgatt 


cggtcggttc 


420 


acggaatgtt 


gtttgatttg 


gttctgtccc gggtcaggtt 


catagtgatt 


ttattccgca 


480 


aaaaaaaaag 


gtttatagtg 


attttgattt ctttcatctc 


gggaacattt 


ttatatctgg 


540 


gagtcaaagg 


gcattggttt 


tgatttgcat gcggaacata 


ttggttattt 


attaatgtgg 


600 


tgagctggaa 


ttcatactgc 


ttaaaacgac gtgattttaa 


ttgctggaag 


aggtaaagaa 


660 


catgaattct 


tgttatattt 


gttaaaaaaa atcccctgtt 


ctagcgtttc 


aatctgcatg 


720 


atcatgga 










728 


<210:> 12 

<211> 2446 

<212> DNA 

<213> Triticum sp. 










<400> 12 
gtgggtctat 


aaaagacagg 


tttgagcgga ttcgtcagga 


aatgtttcaa 


caagtgcgac 


60 


gatgtgggat 


gcaattgatg 


aaaccgtggc ttgatcaaga 


cgcagttgag 


gcggatttgt 


120 


cgggaaatgc 


ttcaagctgc 


gcgacataca gagaagtgga 


tgatgtggtg gatgaaacta 


180 


gatcagaaga 


ggaaacattt 


gcgatggatt tgtttgcaag 


tgaatcaggc 


catgagaaac 


240 


atatggcagt 


ggatcatgtg 


ggtgaagcta ccgatgaaga 


agagacttac 


caacagcaat 


300 


atccagtacc 


gtcttcattc 


tctatgtggg acaaggctat 


tgctaaaaca 


ggtgtaagtt 


36 n 


tgaatcctga 


gctgcgactt 


gtcagggttg aagaacaagg 


caaagtaaat 


tttagtgata 


420 


aaaaagacct 


gtcaattgat 


gatttaccag gacaaaacca 


atcgatcatt 


ggttcctata 


480 


aacaagataa 


atcaattgct 


gatgttgcgg gaccgaccca 


atcaattttt 


ggttctagta 


540 


aacaacaccg 


gtcaattgtt 


gctttcccca aacaaaacca 


gtcaattgtt agtgtcactg 


600 


agcaaaagca 


gtccatagtt 


ggattccgta gtcaagatct 


ttcggctgtt agtctcccta 


660 


aacaaaacgt 


accaattgtt 


ggtacgtcga gagagggtca 


aacaaagcaa 


gttcctgttg 


720 


ttgatagaca 


ggatgcgttg 


tatgtgaatg gactggaagc 


taaggaggga gatcacacat 


780 


ccgagaaaac 


cgatgaggat 


gtgcttcatg taaaatttaa 


tgttgacaat 


gtgttgcgga 


840 
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agcatcaggc agatagaacc caagcagtgg 
aacatcttta catgactgaa catcagatag 
aggatgagct ttctataact gaaattggaa 
tttctgagga agagctttca tggtctgaag 
aatatgaagt tgatgagacc tctgtgtccg 
cacaggatgt tgtggatccg caagcactaa 
attattcgat gaggaacaag ctgtttgttt 
ttgatcttta tttcaatcgt gacctaacag 
aaggagcatt caatggttgg aaatggaggc 
ttggaggggt ttggtggtct tgcaaactgt 
ttgtgttctt caacggtcgc acggtctatg 
gaatagaagg cactatgaat gaagatctgt 
gggagcttga gaaacttgcc atggaagaag 
ggcgaagtaa ggaagcaagg gctgcagatg 
tagagatcaa gaacaaaaaa ttgcagagta 
atttgtggta catagaggct agcacagata 
acagaaactc gaggccactt gcgcatagta 
attggtcaga tggactctct attgttgaaa 
attggtggta tgcagatggt acgacacctc 
attttttttg ttgaggaaac atttgttttg 
atgaatttcc ttgttttatt gatgtcatga 
aagctcaaca tttaccatag acagacgctt 
tgtaatgtaa tacctgtctt ttctctatat 
tgttggactg ggtttttgct gatgggccag 
ctcgacaaga tttccatgct attcttccaa 
tgcaagagga gcaaaacatc tatacaaggc 
ccatgaaaag aaaggtgagt tgcaacaaaa 

<210> 13 

<211> 1032 

<212> DNA 

<213> Triticum sp. 

<400> 13 

gatctctata attttggcag ttaacccctg 
ttttccaaat tcaaaatgca tggttccatg 



- 38 - 

aaacgataac ttggaagaaa gttgatgagg 900 
gtgctgccga aggacagatg gtagttaacg 960 
tggggagagg tgataaaatt cagcatgtgc 1020 
atgaagtgca gttaattgag gatgatggac 1080 
ttaacgttga acaagatatc caggggtcac 1140 
aggtgatgct gcaagaactc gctgagaaaa 1200 
ttccagaggt agtgaaagct gattcagtta 1260 
ctttggcgaa tgaacccgat gttgtcatca 1320 
ttttcactga aagattgcat aagagtgacc 1380 
acatacccaa ggaggcctac agattagact 14 40 
agaacaatgg caacaatgat ttctgtatag 1500 
ttgaggattt cttggttaaa gaaaagcaaa 1560 
ctgaaaggag gacacagact gaagaacagc 1620 
aagctgtcag ggcacaagcg aaggccgaga 1680 
tgttgagttt ggccagaaca tgtgttgata 1740 
caagcggaga tactatcagg ttatactata 1800 
ctgagatttg gatgcatggt ggttacaaca 1860 
gctttgtcaa gtgcaatgac agagacggcg 1920 
aacctttgta cataaggcaa cattgttttg 1980 
attctagcat aatgctccta caaatatggc 2040 
gaaagtattt tattaactcg aaggccatgg 2100 
aaagatcatt tgtattccgt ggatcatata 2160 
gtacagttat tccacctgaa aaagcacttg 2220 
ctgggaatgc aaggaactat gacaacaatg 2280 
acaacaatgt aaccgaggaa ggcttctggg 2340 
ttctgcaaga aaggagagaa aaggaagaaa 24 00 
tctttgcata tagatc 24 46 



agtgatggca aatatattcc ctttcgtcta 60 
caagcttatc caaaatcact tgataatata 120 
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ccaatcacaa cataactttg tttaccataa gaacattcct acttaaaatt tgcaaggtaa 180 

ctccctttcg aggctggttg gcttgatgag taactggcaa ttaacaaaga aaagatatat 240 

ctgatgtttg gaacaaaaca tatgatcagg gttgtttggg ttgactcatg ttccttttta 300 

cctacacagg ctgagagaag tgcaaatatc aaagctgaga tgaaggcaaa aactatgcga 360 

aggtttctgc tttcccagaa acacattgtt tataccgaac cgcttgaaat acgtgccgga 420 

accacagtgg atgtgctata caatccctct aacacagtgc taaatggaaa gccggaggtt 480 

tggtttagat gctcttttaa cctttggatg catccaagtg gagcattgcc accccagaag 540 

atggtgaaat caggggatgg gccgctctta aaagccacag gtttattgcg ttattacatc 600 

actgttatta gtatatatat aaccattttt atgcaatcaa tagagtcaag tgcaactaat 660 

gatgcacaga taggatcaca tcattaggag aatgatgtga tggacaagac ccaatcctaa 720 

gcatagcaca agatcgtgta gttcgttcgc tagagctttt ctaatgtcaa gtatcatttc 780 

cttagaccat gagattgtgc aactcccgga tatcgtagga gtgctttggg tgtatcaaat 840 

gtcacaacgt aactgggtga ctataaaggt gcactacagg tatctccgaa agtttctgtt 900 

gggttggcac gaatcgagac tgggatttgt cactccgtat gacggagagg tatctttggg 960 

cccactcggt aatgcatcat cataatgagc tcaatgtgac taaggagtta gccacgggat 1020 

cgagaattcc eg 1032 

<210> 14 

<211> 892 

<212> DNA 

<213> Triticum sp. 

<400> 14 

aatatttctt gttctattat tggtaataat tagctagttt aatgccataa gcccataaca 60 
gatatgeaac tactccctcc aatccatatt acttgtcgea actttggtac aactttagta 120 
caaagttata ctaaagctgt gacaagtaat atggaccgga gggagtacta tataagcttg 180 
tagctgtttt gagaccgagt gtctgetegg gtggctagct ggageggget gaagtgcttg 240 
caggcacctc ttctcta^aa aaaagtgctt gcagcccccc cgccccctcc atagggtgag 300 
tggtcacctt tcttcttaaa aattatggca ccaagggaaa ttctcggctg gtcgagcttg 360 
tagctatttt ttcggagcgt gaatgggagc gtctttctgt ataaggecta taggcttact 420 
ttgatatata ttgtgaagtc acttaagect tgttaaaacg tagaaactta gttccgcaac 4 80 
ttggccaaat ccctgttaaa ttggtttact gtgtactaga tgcatcgatg gcgcagagtc 540 
ccggggggta ataaagcttc cattttctac aatgaagtta attatcctac ttgccttgta 600 
attactgagt acaatacaga gcaccgaaaa gctgtatcct tcctacttcc ttatgtttat 660 
ctgtgttcct tgtctagtta atgttccacc ggatgectat atgatggact ttgttttctc 720 
cgagtgggaa gaagatggga tctatgacaa caggaatggg atggactatc atattcctgt 780 
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ttctgattca attgaaacag agaattacat gcgtattatc cacattgccg ttgagatggc 840 
ccccgttgca aaggtaatat aattctaagg ctagtttctt tgatgcgagg eg 892 



<210> 15 

<211> 871 

<212> DNA 

<213> Triticum sp. 










<400> 15 
aggttatcct 


ccagaatgaa ttttttccag tacgtattat 


ttagaatact 


ageggtatat 


60 


tgactttttc 


tttgtgagac tacactttct 


tgtttaccat 


tccagtgcac 


catgttcaaa 


120 


atcttgtatt 


cagegegtta ctttcagttt 


ctttactact 


agcttatttg gtgcattggt 


180 


gtttcctttc 


ctactctact atetgaatge 


tacttgtgtt 


ttcgcaacag 


ttgcttcttt 


240 


atccccttcc 


atttctcagt taaaaaaact 


tgcatctgta 


ttcacgtgac 


agcatataat 


300 


acattgecat 


gattggtcaa gtgctccggt 


cgcctggcta 


tataaggaac 


actattccca 


360 


atccagaatg 


gcaagcactc gggttgtatt 


taccatccac 


aatcttgaat 


ttggagcaca 




ttatattggt 


aaagcaatga catactgtga 


taaagecaca 


actgtgagtg 


ccttactgtc 


480 


ttgtaatttt 


taatctttct gtttggcgca 


cagaaaatct 


tccacatttt 


acagaatcat 


540 


gttcttgtgt 


tttgtacgta ttcaactatt 


tccacccaaa 


cttttcaggt 


ttctcctaca 


600 


tattcaaggg 


acgtggcagg ccatggtgcc attgctcctc atcgtgagaa 


attctaegge 


660 


attctcaatg 


gaattgatcc agatatctgg gatcctgatt gccaacatgc tgtttggtcg 


720 


tctcgaggtc 


tttacattgc tggtgctctt taccccgact ttctggcgtg aatgatggag 


780 


taatacgtga 


aaacattaat tcttttctca 


acaagggacg 


gaeaaacgeg 


egagattgee 


840 


tcctacctgg 


etteggaact gaaagaactg g 






871 



<210> 16 

<211> 1592 

<212> DNA 

<213> Triticum sp. 

<400> 16 

egggaattet cgatcccgtg gctaactcct tagtcacatt gagctcatta tgatgatgea 60 
ttaccgagtg ggeccaaaga tacctctccg teataeggag tgacaaatcc cagtctcgat 120 
tcgtgccaac ccaacagaaa ettteggaga tacctgtagt gcacctttat agtcacccag 180 
ttacgttgtg acatttgat.a cacccaaagc actcctacga tatcegggag ttgeacaate 240 
tcatggtcta aggaaatgat acttgacatt agaaaagctc tagegaaega actacacgat 300 
cttgtgctat gcttaggatt gggtcttgtc catcacatca ttctcctaat gatgtgatcc 360 
atacactgac aattttatcc eggtaccaga ttttttccca gagtgcaagt agatatatac 420 
caaggccaca gatagtttta tgcttaacta tgtgtttcat actacttcag gtcccttata 480 
cttgtgagaa tgttgtcgaa ggcaagagag ctgcaaaaag ggccttgcag cagaagtttg 540 
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yatLdcagcd 


aarf na^nf^ 
aoCLyacyLL, 


ttioi. Ly l v-y 


aaatcatcac ccatctaaca acccaaaaaa 


600 


gaa tccacct 


Ca tCadyCaC 




flaarrctcaa aancaacaaa caaottcatc 


660 


atcccttgtg 


aacgaataaa 


Ca LLadaLy L 


tttn^ttat a aaaaottact taetatttat 


720 


ttttgtttac 


ttCaaaaCoa 


aaytCtyaao 


at-naarrffrff tflffhtcctao QtaOttttCfC 
dtyddy Ly LL wyy ttv^toy yk\j^LtkLyv. 


780 


ttggttcagc 


tccagat cat 


eg a a tacaag 


gcgatu t tty L-ay d l Ly y *-l^ ycity 


840 


acggtgttta 


ccacggtagg 


g tgaagcung 


LtCiaoCCtd Lya Lyay u^l tiuLtttaut 


900 


tggtgagctc 


caatatccta 


cacaccatct 


agCCagCCCu tCattdtygy ayLLyyayaL, 


960 


tactttataa 


tttaggttga 


tgatcgatca 


CgCCgcagau ataCyCtgyC tCCyaCttCa 




ttattgtccc 


ttcaatcttc 


gaaccctgtg 


gcttaacaca acttyttyce dtycgtLdty 


1080 


gatcgatccc 


tatagttcgg 


aaaaceggag 


gt.gi.gt.gacL atttctctcc attatycLyc 


114 0 


actgatttgc 


ata tgtcgag 


erg r tggaca 


frtaaahnrraa a t" \a t* f~ 1" t" t" art t" A t" fncaet 
Uydddtyydd aLLdi.LtLLL y y Let LL-y Lay 


1200 


gactttacga 


cactgtcttc 


gaegtagaca 


atgataagga ccgggctcgg tctcttggtc 


1260 


ttgaaccaaa 


tgggttcagt 


ttcgaeggag 


ccgacagcaa cggcgtggat tatgccctca 


1320 


acaggcaagt 


atcgttcctc 


aattagcect 


gaattcagca gtagtgctag gttatttacc 


1380 


ttgcatgttc 


catacctcat 


ttcagagcaa 


teggegcttg gttcgatgee cgtgattggt 


1440 


tccactccct 


gtgtaagagg 


gtcatggaac 


aagactggtc atggaaccgg cccgcactgg 


1500 


actacattga 


attgtaccat 


gccgctcgaa 


aattctgaca cccaactgaa ccaatggcaa 


1560 


gaacaagcgc 


attgtgggat 


cgagaattcc 


eg 


1592 



<210> 17 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 17 

Asp Val Gin Leu Val Met Leu Gly Thr Gly 
1 5 10 

<210> 18 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 18 

Ala Ala Gly Lys Lys Asp Ala Gly lie Asp 
15 10 



<210> 19 
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<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 19 

Ala Thr Gly Lys Lys Asp Ala Gly He Asp 
1 5 10 



<210> 20 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 20 

Ala Leu Gly Lys Lys Asp Ala Gly He Asp 
1 5 10 



<210> 21 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 21 

Ala Thr Gly Lys Lys Asp Ala Leu 
1 5 



<210> 22 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 22 

Ala Leu Gly Lys Lys Asp Ala Leu 
1 5 



<210> 23 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF ; 

i 

<400> 23 | 
Ala Ala Gly Lys Lys Asp Ala Arg Val Asp Asp Asp Ala Ala ; 

1.5 10 : 

{ 
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<210> 24 
<211> 13 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : PEPTIDE MOTIF 
<400> 24 

Ala Leu Gly Lys Lys Asp Ala Gly He Val Asp Gly Ala 
15 10 



<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 25 

tgttgaggtt ccatggcacg ttc 



<210> 26 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 26 

agtcgttctg ccgtatgatg teg 



<210> 27 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 27 

ccaagtacca gtggtgaacg c 



<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 28 

cggtgggatc caacggccc 



<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 29 

ggaggtcttg gtgatgttgt 



<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 30 

cttgaccaat catggcaatg 



<210> 31 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 31 

cattgccatg attggtcaag 



<210> 32 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 32 

accacctgtc cgttccgttg c 



<210> 33 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 33 

gcacggtcta tgagaacaat ggc 



<210> 34 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 



<400> 34 
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tctgcatacc accaatcgcc g 



<210> 35 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 35 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala Val 
15 10 15 

Gin Asp Leu Gly His Asn Val Glu Val 
20 25 



<210> 36 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE MOTIF 
<400> 36 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser Leu Ser Arg Ala lie 
15 10 15 

Gin Asp Leu Gly His Thr Val Glu Val 

20 25 . 



<210> 37 

<211> 9024 

<212> DNA 

<213> Triticum sp. 

<400> 37 

aaatatgaaa ccaaaaaaaa aatagaaaaa ggaaaggtaa aatagaaagt taaataggaa 60 
taatggataa aaaataaaac atcaaagaaa aacgaaatgc agaagaaaaa aacgtcactt 120 
gttcccttat tatctcccgt gcaccccggt agcgtaggac aaaaagaaaa aatagaacgg 180 
acccaacgtc acaagctcac acatgcccag cgagagaaaa gaaaaatggt gcgacaaaaa 240 
aaaggaaacg ggctgagagc cgaaacacat gggctgcgct ttgttcgcta cgaagctctc 300 
ccctcgacaa aatatgaatc gcgacgtgat tggatcctat ggtggaaaaa gtgaatgaga 360 
ccaaaagaat tctcagctga atgagtttta gcaagactga tcattatatc caacataaat 420 
agattttttt tttgcaaaaa taatccaaat ctattagcaa agttcagtag aagtacaaag 480 
catctcgaac attataaaca ttacactgag attccaggac caccaaacaa cccactactg 540 
ccgcgaaaag aaaaggattc ggaagacaga aattatccaa accacgttcg tccttggttg 600 
ttggtctcat tgcgcgctaa acaacctgga cagcagaaga agcaaagcag tgtgcttccg 660 
ctccgcagca agaagacaag tcgtcacatg tcagacgccg tcactcaagc aagcaaactg 720 
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caatgcttct cgttcggttt atcccctagc acgcacgaac gcatgtgccg caccgcgtca 780 
cgcaacgcat gcatgcacaa accaacaaac gaaacagtgc agttgcagtg ctctatctac 840 
atatacgcaa tcaacgcggg cctcctcctt cgccgcgagc cccgttccgt cctcggtctt 900 
cacgtggatt ttgcaacttc cttccagcag cttgtcacca cggacgcttc ctctctgaca 960 
actggccccg tgggcggaac ggggcctccg ctcgcccctt gcgaaaccca cggctcgtcc 1020 
gttcgcttct ctagcgggca ccgacagaag gggccggcgc agggtaggac caggctgtca 1080 
gctggtgagg agcctgccgc tcgttgtgcc gcagctggag accgagcggg gcaacggaac 1140 
ggctgccgcc ctcgtgtgct gctcgcgtgg cacgccgcaa cggcaccggg cccgctttcc 1200 
agcgtgctcg cccgcaaacc gcagacccaa cacgccagcc gccagggggc cgttcgtacg 1260 
tacccgcccc tcgtgtaaag ccgccgccgt cgtcgccgtc ccccgctcgc ggccatttct 1320 
tcggcctgac cccgttcgtt tacccccaca cagagcacac tccagtccag tccagcccac 1380 
tgccaccgcg ctactctcca ctcccactgc caccacctcc gcctgcgccg cgctctgggc 1440 
ggaccaaccc gcgaaccgta ccatctcccg ccccgatcca tgtcgtcggc ggtcgcgtcc 1500 
gccgcatcct tcctcgcgct cgcgtcagcc tcccccggga gatcacgcag gcgggcgagg 1560 
gtgagcgcgc agccacccca cgccggggcc ggcaggttgc actggccgcc gtggccgccg 1620 
cagcgcacgg ctcgcgacgg agctgtggcg gcgctcgccg ccgggaagaa ggacgcgggg 1680 
atcgacgacg ccgccgcgtc cgtgaggcag ccccgcgcac tccgcggtgg cgccgccacc 1740 
aaggtagtta gttatgacca agttatgacg cgtgcgcgcg cctcgagatc atcgtcgtct 1800 
cgctcacgaa ttgtttattt atacaaaacg cacgcccgcg tgtgcaggtc gcggagcgaa 1860 
gggatcccgt caagacgctc gaccgcgacg ccgcggaagg cggcgggccg tccccgccgg 1920 
cagcgaggca ggacgccgcc cgtccgccga gtatgaacgg catgccggtg aacggcgaga 1980 
acaaatctac cggcggcggc ggcgcgacta aagacagcgg gctgcccacg cccgcacgcg 2040 
cgccccatcc gtcgacccag aacagagcac cggtgaacgg tgaaaacaaa gctaacgtcg 2100 
cctcgccgcc gacaagcata gccgaggccg '.ggcttcgga ttccgcagct accatttcca 2160 
tcagcgacaa ggcgccggag tccgttgtcc cagctgagaa gacgccgccg tcgtccggct 2220 
caaatttcga gtcctcggcc tctgctcccg ggtctgacac tgtcagcgac gtggaacaag 2280 
aactgaagaa gggtgcggtc gttgtcgaag aagctccaaa gccaaaggct ctttcgccgc 2340 
ctgcagcccc cgctgtacaa gaagaccttt gggatttcaa gaaatacatt ggtttcgagg 2400 
agcccgtgga ggccaaggat gatggccggg ctgtcgcaga tgatgcgggc tcctttgaac 24 60 
accaccagaa tcacgactcc ggacctttgg caggggagaa tgtcatgaac gtggtcgtcg 2520 
tggctgctga gtgttctccc tggtgcaaaa caggcatgga cattacctct tcagtctctc 2580 
ttcctgttgt tcataaaact ttgctcgaat tactcataag aacaaacatt gtgttgcata 2640 
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ggtggtctgg gagatgttgc gggtgctctg 
gttatggtac tacaagcttt catttaactc 
tgagtagtat aatgttatta agtgcaagac 
gtaaattaat ataagagcgt ttagattact 
tttacagacg gagtagagta tttcatagcc 
tgggtggggg agggggtttg aaacaagtgg 
ggctgataac cacaccatca gtgaaggaat 
ccaacgtcgg gtttacccgc cctatagatc 
accaaatatc gccagcgccc gtgtgtgtat 
tttccggtta atggtttcta tcatattcac 
aatgtagatg atggataaat gtatgttgtc 
gagctagttt cgcggttcgg ttagagccat 
gtgagagagg gttttgggga gttaactttc 
ccagtaaaga gtaaactatt ttctgcaggc 
gaaaatagtt atggtatcat ataaaccata 
gctagacttt gataatctga aattttaaat 
tcttaggttg tggtaccaag gtatggggac 
aaatactaca aggctgctgg acaggtaagc 
tattgcttat tgtcataata aatcaatttt 
aagtgaatta tttccatgct tatatcgatg 
tcttccgaca ccgtcaggaa gacatttatg 
ttggtgtttg attgcactga taaactgaga 
ttacacattt tattttttca ggaaattatg 
gttgaggtat ctctccaact caattgacaa 
atgtatttca acagatacat aatctcttgt 
accttacatg cacatttggt caagcgttat 
tgaacaatta tcttgatgat ccttgttact 
gcgaattgat ttggaaatag catttccacc 
tcatccaatt tagatatttt cgtacttggc 
catttttatt cctctataat ttgcaggttc 
atggggatgg aaatctggtg tttattgcaa 
atctgaaagc atattacagg gaccatggtt 
tacataacat cgctcaccag gttccttttc 



-47- 

cccaaggctt tggcaaagag aggacatcgt 2700 
tgttgggtcc atatgttcga ataatatcag 27 60 
atgaaagtgt tcttctgtca tactccctcc 2820 
actttagtga tctaaacgct cttatagtag 2880 
aaccctggag gttaggttgc tgaggcctac 2940 
tggttagcag ccagatttca caaagaagga 3000 
gaatgtcggg tacccgatcg accgttttgc 3060 
cgaataagta gttcctatct tcaattaggt 3120 
ttatactact ggatgatcaa tttatcaaca 3180 
tgtaattgtt agtaaacagt agatgtttgt 3240 
gagctttcat ttcaatgcaa ttttgattgg 3300 
caaaacccca gaatttttgg gagttggctt 3360 
gggattcagt tagagacgct cttactagtt 3420 
atcccaatta ttctgtagaa attagaagtg 3480 
tattattcaa aatctagaat catggacttg 3540 
ttgatgataa ttgagaaatg atcctttcta 3600 
tatgaagaag cctacgatgt cggagtccga 3660 
aaaaatgcaa tcgaagggga gctgaaattt 3720 
taagtgtttt ttttgtcctg caggatatgg 3780 
gagttgattt tgtgttcatt gacgctcctc 3840 
ggggcagcag acaggttaat cttctatatg 3900 
acaagccaag gcctactgac tggcatatga 3960 
aagcgcatga ttttgttctg caaggccgct 4020 
cctattacca ctatacaatt atgtgtatgc 4080 
gaagtgcata tatactaata acatttcaat 4140 
gatttaactt ctgataatct attgcactga 4200 
tcatcgttat gtttccatgt tctcttcacc 4260 
tgccacaaac aataatatac actcctactt 4320 
atatcatccc attaaatatt attggtccat 4380 
catggcacgt tccatgcggc ggtgtccctt 44 40 
atgattggca cacggcactc ctgcctgtct 4500 
tgatgcagta cactcggtcc attatggtga 4560 
tcctaatctt gatttttctc tagtctctac 4 620 
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tatttactcc acattgtttg aggaaactaa 
agttatagtc ttatagaggt aaatgcacca 
tttggtgctt acagttgtag actatgaaaa 
tacggtgcat tttccgtatg taggagtcaa 
ctatagctgt tagaccgtgc ctacgtcgcc 
gggccccact tgtcaaccta tgacataaat 
tggggtcttg aaaatgggac ctcgcaggta 
tcccctatgc acttcatgtc ttgtgtatgt 
atgctgtttt tctttggttc aaggctacca 
ggccagcgcc ttcatgatgg cccaagtgct 
tgtcatgacc atccaccaac ccaacacaca 
gcccctcttg tcctttcccc tcgtacccaa 
agttgtgacc atcgcctgcg tcgcctcata 
cctacttggg agcccatacc tccctgcaca 
atcgcaaacc aacttctcct ctccttctcc 
gcaatacatg ccgagttggc catggcccta 
ctctaagcct agcacctttt cccctcacca 
cctacgtcgg ctgcagttgc ctgccgcctc 
tcggcgacat ctcctcgacc tcccattcca 
catccatgtg aaccgaatca tcatagaact 
cactgttcct ctattccccc caagccgtgt 
atcccttggg tcatcggttc aatggctatt 
cacacgccac actaagccct ttctttctcc 
tacttagcca gagagagaac atgagcttgt 
tcttaacggjc tacaaacaac ggatatggtg 
tcatactgca tgcgagagcc agagccaggt 
gcgagcatca aagtgtacat atgccgaacc 
ggtgcggtgg gtggctcaaa gacaccccaa 
cggtgccgaa ccatattgaa gtggtgaggt 
ggatgaggga cataaaggat ctcataaata 
gcgaagcgct tcatgatttc catctcccct 
atttctcagg tcgcttctcg tctaaatccg 
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acgggttgca aaattatgat ggcttatgaa 4680 
gtggtgcttg aacttgtcac gcgtgttcac 4740 
acgggtgcaa aaacttgctg ttgtgtgcca 4800 
acgttgccta tgtgggcatt gtattcccgt 4860 
attgggccca cacactctct atttacatgt 4920 
aaatggaaat ttataataaa aatgatggcc 4980 
tgctggtagc cagcacgccc taaacattaa 5040 
gtgtgtctgt gtggggaggg gggggtatgc 5100 
tgctcaacaa gcccacctcc gcttcaacac 5160 
ccgcaccatc gctcaaagcg gcaacgtcgt 5220 
aaatcctcaa catccgcaaa tagtgagcat 5280 
acatgtcttg ataacccttg gagctgcaca 5340 
gagcccgacc tagccggacc gttatagaag 5400 
tcctcctctt tccccataga tcgtgccgcc 54 60 
cactctggcc gtttcccccg ccgcgaagct 5520 
ttccccaatt gctcgcacta ggaggtcctc 5580 
attgcaagtt ggggagcccc tcgcgagctc 5640 
aactctgatc cagacctcgt tcccgtggcc 5700 
cacgtggcct ggcgaggatc accgcatgtt 57 60 
aacaccggag aggtcatccc gacggcgtcg 5820 
cgcgtcataa tataagacgg acttatttgt 5880 
tctttctcct gtctactgat aagtgggacc 5940 
tacccgttga taagtgggac ccacacacag 6000 
tggtgccacg tcggcaagcc atgtcagcag 6060 
tcacgtgagc gtttacgaat ggaaagtgca 6120 
ttttgcacca gttttctgta ttttacaact 6180 
aaagtgaaca tggtgagtcc attcttttct 6240 
tagaagctat tgcctccgac attgccaatt 6300 
cagttgcttg tgctatgact actaggtatt 6360 
ttgcaatgtt cattcaaatt cttaacattt 6420 
agatcagaga cacttggtcg tgtacactga 6480 
catatgtagc tcacttcaat gacttgcctt 6540 
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tggtccagct 


aacgccattt 


gcgtagcaaa 


atttggatca 


cgggcagacg 


cgctagacaa 


tgatccgctc 


ttcccgaaga 


cacttgtgat 


tgtctctcca 


tggtcgcccc 


agccatagat 


aggaacaggg 


tgccaccttc 


ggacaagaag 


ttgatttgga 


aggaacacaa 


caacagtctt 


tattagacgg 


atcaagtgtg 


atgaatccta 


gtcactattt 


ggctaggtcg 


cttgccatcc 


tgctcaattt 


gtattttgtt 


gttatgtgtt 


aagtagaaaa 


aaattctcct 


ccataatgat 


aaaaatgaga 


acatccgtgg 


caagtttaag 


tatacaacac 


tgacatgccg 


aattacatgc 


cgttgggcta 


attctttctc 


ttcatgttgc 


ccgttcaccg 


agttgcctga 


gcactacctg 


ggtgaacacg 


ccaactactt 


cgccgccggc 


agccccgggt 


acctgtggga 


gctgaagacg 


atacggcaga 


acgactggaa 


gacccgcggc 


aaccccgagg 


tggacgccca 


cctcaagtcg 


ctggactccg 


gcaagcggca 


gtgcaaggag 


cgcgccgacg 


tgccgctgct 


cggcttcatc 


atcatcgcgg 


acgccatgcc 


ctggatcgtg 


accgggcgcc 


acgacctgga 


gagcatgctg 


gtgcgcgggt 


gggtggggtt 


ctccgtgcgc 


gcgctcctca 


tgccctcccg 


gttcgagccg 


tacggcaccg 


tccccgtcgt 


gcacgccgtt 


gaccccttca 


accactccgg 


gctcgggtgg 


atcgaggcgc 


tcgggcactg 


cctccgcacc 


ctccaggagc 


gcggcatgtc 


gcaggacttc 


gacgtcctcg 


tcaaggccaa 


gtaccagtgg 


gcatgcgtgc 


atgacaggat 


ggaactgcat 


ggcatccgcg 


aagtacagtg 


acatgaggtg 


cccgtagcag 


agtagagcgg 


aggtatatgg 


gttgtgtgca 


ttattacaat 


gttgttactt 
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tttttcatat 


ggctcgctct 


gcgcaagagg 


6600 


ggtcttccgc 


acaatgaaca 


ttgagttttt 


6660 


cttattacga 


gttgtgccat 


ttcaaacatc 


6720 


gccttgttct 


ctgaatggtg 


ggtttcagct 


6780 


ttgcgtagtt 


tggtcgtctt 


aactgcttgg 


6840 


tgaaggcaaa 


gctaattcct 


tcgatcaagt 


6900 


ctggtacaat 


gccgttgcta 


gttgcttgga 


6960 


cgctctgtgc 


taagcgcttg 


gggtcgcttt 


7020 


tttagtaatg 


taacctgaac 


tttctggact 


7080 


cacatacagt 


tctcctgcat 


ggttcgaaaa 


7140 


caccaccggt 


gcatttttac 


ctcaaagtta 


7200 


tttggtcagt 


tattccattc 


ttcggtactc 


7260 


atgcagggcc 


gtggccctgt 


agatgaattc 


7320 


gaacacttca 


gactgtacga 


ccccgtgggt 


7380 


ctgaagatgg 


cggaccaggt 


tgtcgtggtg 


7440 


gtggagggcg 


gctgggggct 


tcacgacatc 


7500 


atcgtcaacg 


gcatcgacaa 


catggagtgg 


7560 


gacggctaca 


ccaacttctc 


cctgaggacg 


7620 


gccctgcagc 


gcgagctggg 


cctgcaggtc 


7680 


ggccgcctgg 


acgggcagaa 


gggcgtggag 


7740 


agccaggacg 


tgcagctggt 


gatgctgggc 


7800 


cggcacttcg 


agcgggagca 


ccacgacaag 


7860 


ctggcgcacc 


ggatcacggc 


gggggcggac 


7920 


tgcgggctga 


accagctcta 


cgccatggcc 


7980 


ggcggcctca 


gggacaccgt 


gccgccgttc 


8040 


acgttcgacc 


gcgccgaggc 


gcacaagctg 


8100 


taccgagact 


tcaaggagag 


ctggagggcc 


8160 


agctgggagc 


acgccgccaa 


gctctacgag 


8220 


tgaacgctag 


ctgctagccg 


ctccagcccc 


8280 


tgcgcacgca 


ggaaagtgcc 


atggagcgcc 


8340 


tgtgtggttg 


agacgctgat 


tccaatccgg 


8400 


gaatcttaac 


ttggtattgt 


aatttgttat 


8460 


attcttgtta 


agtcggaggc 


caagggcgaa 


8520 
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agctagctca catgtctgat ggatgcacgt 

cggcaagaat gggaagtgaa ttcctccctg 

cagttaaaac aatagcactt cgagtggaag 

tggactcata gcatgttacc aaaaaatgcc 

ccatcaacat ttgaacctat acaaactaga 

ccagatacat aggtgccaaa gggctacaac 

caagtgaagg caacaagcat cactacggag 

gaagagttga agttgtgatt tgacgaaacc 

acacgtcacc gtccaatcca aaga 

<210> 38 
<211> 11611 
<212> DNA 

<213> Triticum aestivum 
<400> 38 

taatccgttt gtctaatgaa atatatgtga 
cacccccccc ctatatgagt attaaattca 
acaattttgg gatactaaac ctggatgctc 
aaatatcagg aaacgtattc tcagtaaaaa 
ctgtttgggt ggatcatagg tcagactata 
ttttcacaaa acttcacatg agagtagatt 
caagttcttt cgaattttcc tagtattttt 
tccaggagct ctggtgtatt tttcaatata 
acaatctcag aaaaaactgg acggtttatt 
tcagaagtgg ccctcagccc ctcactcttc 
tcctgcacga acattcgcgt tgaagttttt 
agaaagcaag tacaaaaaac accagccatc 
ttccatgtgt gcgcacacgg agaagcagct 
tcgaagctgc tctcggacaa aatggttgaa 
tccacgccag agcgttgtat tccaatttta 
cgggcgaggc agaggggata gggcagtcgc 
gtggtgggtt ttgcgggcgg ggtttagtag 
acggagccct ctgtgccctc ggagcagtca 
cggcggcggc ctcgcgcagg tacgggtgat 
tgttgtttga tttggttctg tcccgggtca 
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gccatggttg gtttggtagc gcagtgcaaa 8580 

cttgaattag cactttcagt aataatcagt 8640 

tgaacaagaa aaccaacatc acacccggta 8700 

tttcgccccg ctgtatatat aaagcaacga 8760 

acacaccact caaaacccac acactcaggg 8820 

cacaacacac cgaaagactc acatagacta 8880 

cctccggcgt ccttccgatg aagaaatcat 8940 

gtgcgctcca aaacggtgcc ttcaggaagg 9000 

9024 



tgggagagga tttggagcat tggggtgctc 60 
aaaaacaaac cgaggatatt caaaaagtct 120 
agtctactcc catgtgaagt ttcatgaaaa 180 
cagacaaaaa attcttatgc acagaaaaaa 240 
ttttcttcca tggatacatg tcatggtatt 300 
tgggcatcca agtttgatat ccccatattc 360 
tgaaattaat attcgtatag gggtggagca 420 
tgtatggtta tttaaaaaaa aactcgtaca 480 
ctagctgatt ttgtgtgcag tttcccataa 540 
ttcctcctac cttctgctct gtcttccgct 600 
tcaaaagaaa acaatatact tgctggaaaa 660 
caccaccgtc cgttactggt ccacctgcat 720 
cgaacaaaaa aaccaaacga aaataaagga 780 
ggacgaagga gcctttttgg tgcgcagatc 840 
gttctttccc cgtgaggagg ggaggctagg 900 
cgctgcgtgg tggactgact ggtgtggtgg 960 
gttcccggaa atggagatgg ctctccggcc 1020 
gccgctcgtc gtcgtccggc cggccggccg 1080 
tatggttctt gattcggtcg gttcacggaa 1140 
ggttcatagt gattttattc cgcaaaaaaa 1200 
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aaaggtttat 


agtgattttg 


atttctttca 


aagggcattg 


gttttgattt 


geatgeggaa 


ggaattcata 


ctgcttaaaa 


egaegtgatt 


ttctgttata 


tttgttaaaa 


aaaatcccct 


aaatgttaat 


gttaatgctg 


gttaatttgg 


caagaaacag 


aaattcattg 


cgaaaaaatg 


tctgcacctt 


gtatgtttgt 


gatgaagtta 


tccgtatgta 


aggegagcat 


tgccatcttt 


gagtgtatca 


gtagttcgaa 


ttgcgctaat 


atataaattg 


gattacatcc 


tttctgttga 


ggtcagaaaa 


tgagatacag 


tggggacctt 


gaggtcagaa 


ggegatttea 


gtagaattta 


aaatgacctt 


gataattctg 


ttccgattgt 


tatctataag 


aaggtttcct 


ttttacgetc 


cttttgtgtt 


ttccagcctt 


ttttgatgaa 


atgcatggta 


gcaagttcag 


gtttgaggaa 


gtttggaagc 


aatgtcttat 


tcaaacctca 


tgacaatgaa 


tactgtagtt 


tatgaaacca 


gtatgtttgg 


caatcaaaag 


tatacagegt 


gttgcgtttc 


tcagtttttt 


aaaaagaggt 


actacagttt 


tgtggatact 


gtagtttata 


tcaaagtatt 


taaaaccata 


gtttttagaa 


atactttgca 


acgaaacaca 


geccagatgt 


tcattcagat 


cctcctaata 


ggaaatcaag 


ttcttctaga 


ggatatacga 


caagactcat 


C a A 1" » A t" C^CICI 


MO L\J UUvj CI Q-i 


ctcttgatac 


agaatggaca 


gatactagag 


aagecgagae 


aagcagttct 


ataategggg 


gagtggatgt 


gacagtgaat 


tcattaagca 


gtataacgaa 


agttaaagaa 


gaegtatttg 


agctggattt 


ggatgtgatg 


gatcataatg 


ggactgtaca 


gatggatgat 


geggeggaca 


aagctagagt 



-51 - 



tetegggaac 


atttttatat 


ctgggagtca 


1260 


catattggtt 


atttattaat 


gtggtgagct 


1320 


ttaattgctg 


gaagaggtaa 


agaacatgaa 


1380 


gttctagcgt 


ttcagtctgc atgatcatgg 


1440 


agtgaagatt 


tccacggcaa gagtttcgaa 


1500 


gtggagcgaa 


ttcggagagt atttacattg 


1560 


tttccatata 


ttttttgega 


taaagttact 


1620 


ctataagctg 


gtatttgtct gecagatage 


1680 


gttttttgac 


gaaacgaaac 


tatgaagacg 


1740 


aeggagaaat 


ttatccttgc 


ttagaagtga 


1800 


ccctactgta 


ttatgctaaa 


aagaagaagt 


1860 


tatgagaggc 


ataaataatt 


tggtaggatt 


1920 


tegcaaatae 


etteggattt 


tctcaagcat 


1980 


aaacatgttg 


agetgeacaa 


cttattttcc 


2040 


tggcagattt 


actcgaagca 


ggacccttcg 


2100 


taatctgtca 


aatggcctat 


cattctatct 


2160 


gtattttgat 


actacggttt 


tetatagega 


2220 


acagtctttc 


taagtatttc ggcaacagtg 


2280 


tgeaatagge 


caccagtaga caaggecttt 


2340 


cccaactact 


ttttttaata 


ctgeaaaaac 


2400 


atactacaat 


ttttattaca 


gccaaacacc 


2460 


aaactgtagt 


atccttgaaa 


tactttgaga 


2520 


tctgttaact 


tcatgtcttt 


ecaaattgea 


2580 


aaagatggta 


tcacctcagg 


ttaaagtcat 


2640 


tgttgaacca 


agcaccgaga 


atatagaaca 


2700 


atacaatgeg 


ctattaagta 


ccgagacagc 


2760 


tgctaaagcg 


gactcgtcgc 


aaaatgcttt 


2820 


ggcggatgaa 


gatatacttg 


eggctgatet 


2880 


gaaggaagtg 


gatgcagtgg 


acaaagctag 


2940 


gccagcaact 


acattgagaa 


gtgtgatagt 


3000 


agagacattg 


agaagtgtga 


tagtagatgt 


3060 


tgaagaagac 


gtatttgagc 


tggatttgtc 


3120 
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aggaaatatt 


tcaagcagtg 


cgacgaccgt 


tgttcaagac 


acatttgagg 


cgaactcgtc 


ggaagtggat 


acgagtgctg 


aagctgggaa 


aggaaatgtt 


ttttcaagca 


gtacaacagt 


tataaaagac 


aggtttgaga 


cggattcgtc 


ggatgcaatt 


gatgaaaccg 


tggctgatca 


tgcttcaagc 


tgcgcgacat 


acagagaagt 


agaggaaaca 


tttgcgatgg 


atttgtttgc 


agtggatcat 


gtgggtgaag 


ctaccgatga 


accgtcttca 


ttctctatgt 


gggacaaggc 


tgagctgcga 


cttgtcaggg 


ttgaagaaca 


cctgtcaatt 


gatgatttac 


caggacaaaa 


taaatcaatt 


gctgatgttg 


cgggaccgac 


ccggtcaatt 


gttgctttcc 


ccaaacaaaa 


gcagtccata 


gttggattcc 


gtagtcaaga 


cgtaccaatt 


gttggtacgt 


cgagagaggg 


acaggatgcg 


ttgtatgtga 


atggactgga 


aaccgatgag 


gatgtgcttc 


atgtaaaatt 


ggcagataga 


acccaagcag 


tggaaacgat 


ttacatgact 


gaacatcaga 


taggtgctgc 


gctttctata 


actgaaattg 


gaatggggag 


ggaagagctt 


tcatggtctg 


aagatgaagt 


agttgatgag 


acctctgtgt 


ccgttaacgt 


tgttgtggat 


ccgcaagcac 


taaaggtgat 


gatgaggaac 


aagctgtttg 


tttttccaga 


ttatttcaat 


cgtgacctaa 


cagctttggc 


attcaatggt 


tggaaatgga 


ggcttttcac 


ggtttggtgg 


tcttgcaaac 


tgtacatacc 


cttcaacggt 


cgcacggtct 


atgagaacaa 


aggcactatg 


aatgaagatc 


tgtttgagga 


tgagaaactt 


gccatggaag 


aagctgaaag 


taaggaagca 


agggctgcag 


atgaagctgt 


caagaacaaa 


aaattgcaga 


gtatgttgag 
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ggaactagat 


gcggttgacg aagtcgggcc 


3180 


aggaaatgtt 


tcaaacagtg caacggtacg 


3240 


tgatcaaggc 


atatttagag cagatttgtc 


3300 


ggaagtgggt 


gcagtggatg aagctgggtc 


3360 


aggaaatgtt 


tcaacaagtg cgacgatgtg 


3420 


agacgcagtt 


gaggcggatt tgtcgggaaa 


3480 


ggatgatgtg 


gtggatgaaa ctagatcaga 


3540 


aagtgaatca 


ggccatgaga aacatatggc 


3600 


agaagagact 


taccaacagc aatatccagt 


3660 


tattgctaaa 


acaggtgtaa gtttgaatcc 


3720 


aggcaaagta 


aattttagtg ataaaaaaga 


3780 


ccaatcgatc 


attggttcct ataaacaaga 


3840 


ccaatcaatt 


tttggttcta gtaaacaaca 


3900 


ccagtcaatt 


gttagtgtca ctgagcaaaa 


3960 


tctttcggct 


gttagtctcc ctaaacaaaa 


4020 


tcaaacaaag 


caagttcctg ttgttgatag 


4080 


agctaaggag 


ggagatcaca catccgagaa 


4140 


taatgttgac 


aatgtgttgc ggaagcatca 


4200 


aacttggaag 


aaagttgatg aggaacatct 


4260 


cgaaggacag 


atggtagtta acgaggatga 


4320 


aggtgataaa 


attcagcatg tgctttctga 


4380 


gcagttaatt 


gaggatgatg gacaatatga 


4440 


tgaacaagat 


atccaggiggt caccacagga 


4500 


gctgcaagaa 


ctcgctgaga aaaattattc 


4560 


ggtagtgaaa 


gctgattcag ttattgatct 


4620 


gaatgaaccc 


gatgttgtca tcaaaggagc 


4680 


tgaaagattg 


cataagagtg accttggagg 


4740 


caaggaggcc 


tacagattag actttgtgtt 


4800 


tggcaacaat 


gatttctgta taggaataga 


4860 


tttcttggtt 


aaagaaaagc aaagggagct 


4920 


gaggacacag 


actgaagaac agcggcgaag 


4980 


cagggcacaa 


gcgaaggccg agatagagat 


5040 


tttggccaga 


acatgtgttg ataatttgtg 


5100 
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gtacatagag gctagcacag atacaagcgg 
ctcgaggcca cttgcgcata gtactgagat 
agatggactc tctattgttg aaagctttgt 
gtatgcagat ggtacgacac ctcaaccttt 
ttgttgagga aacatttgtt ttgattctag 
tccttgtttt attgatgtca tgagaaagta 
acatttacca tagacagacg cttaaagatc 
taatacctgt cttttctcta tatgtacagt 
ctgggttttt gctgatgggc cagctgggaa 
agatttccat gctattcttc caaacaacaa 
ggagcaaaac atctatacaa ggcttctgca 
aagaaaggtg agttgcaaca aaatctttgc 
cctgagtgat ggcaaatata ttccctttcg 
catgcaagct tatccaaaat cacttgataa 
ataagaacat tcctacttaa aatttgcaag 
tgagtaactg gcaattaaca aagaaaagat 
cagggttgtt tgggttgact catgttcctt 
tatcaaagct gagatgaagg caaaaactat 
tgtttatacc gaaccgcttg aaatacgtgc 
ctctaacaca gtgctaaatg gaaagccgga 
gatgcatcca agtggagcat tgccacccca 
cttaaaagcc acaggtttat tgcgttatta 
ttttatgcaa tcaatagagt caagtgcaac 
ttgttctatt attggtaata attagctagt 
actactccct ccaatccata ttacttgtcg 
tactaaagct gtgacaagta atatggaccg 
ttgagaccga gtgtctgctc gggtggctag 
tcttctctaa aaaaaagtgc ttgcagcccc 
tttcttctta aaaattatgg caccaaggga 
ttttcggagc gtgaatggga gcgtctttct 
tattgtgaag tcacttaagc cttgttaaaa 
atccctgtta aattggttta ctgtgtacta 
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agatactatc aggttatact ataacagaaa 5160 
ttggatgcat ggtggttaca acaattggtc 5220 
caagtgcaat gacagagacg gcgattggtg 5280 
gtacataagg caacattgtt ttgatttttt 5340 
cataatgctc ctacaaatat ggcatgaatt 5400 
ttttattaac tcgaaggcca tggaagctca 54 60 
atttgtattc cgtggatcat atatgtaatg 5520 
tattccacct gaaaaagcac ttgtgttgga 5580 
tgcaaggaac tatgacaaca atgctcgaca 5640 
tgtaaccgag gaaggcttct gggtgcaaga 5700 
agaaaggaga gaaaaggaag aaaccatgaa 5760 
atatgatctc tataattttg gcagttaacc 5820 
tctattttcc aaattcaaaa tgcatggttc 5880 
tataccaatc acaacataac tttgtttacc 5940 
gtaactccct ttcgaggctg gttggcttga 6000 
atatctgatg tttggaacaa aacatatgat 6060 
tttacctaca caggctgaga gaagtgcaaa 6120 
gcgaaggttt ctgctttccc agaaacacat 6180 
cggaaccaca gtggatgtgc tatacaatcc 6240 
ggtttggttt agatgctctt ttaacctttg 6300 
gaagatggtg aaatcagggg atgggccgct 6360 
catcactgtt attagtatat atataaccat 6420 
taatgatgca cagataggat ccaatatttc 6480 
ttaatgccat aagc.;cataa cagatatgca 65^0 
caactttggt acaactttag tacaaagtta 6600 
gagggagtac tatataagct tgtagctgtt 6660 
ctggagcggg ctgaagtgct tgcaggcacc 6720 
cccgccccct ccatagggtg agtggtcacc 6780 
aattctcggc tggtcgagct tgtagctatt 6840 
gtataaggcc tataggctta ctttgatata 6900 
cgtagaaact tagttccgca acttggccaa 6960 
gatgcatcga tggcgcagag tccggggggt 7020 
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aataaagctt ccattttcta caatgaagtt aattatccta cttgccttgt aattactgag 7080 
tacaatacag agcaccgaaa agctgtatcc ttcctacttc cttatgttta tctgtgttcc 7140 
ttgtctagtt aatgttccac cggatgccta tatgatggac tttgttttct ccgagtggga 7200 
agaagatggg atctatgaca acaggaatgg gatggactat catattcctg tttctgattc 7260 
aattgaaaca gagaattaca tgcgtattat ccacattgcc gttgagatgg cccccgttgc 7320 
aaaggtaata taattctaag gctagtttct ttgatgcgag gcgagatctc atcaccttat 7380 
gccttttttt cattctatgc cataatacta tgctctgtca tgatcgatga tctcataggt 7440 
tggaggtctc ggggatgttg ttacaagtct ttcacgtgcc gttcaagatc ta.gggcatac 7500 
tgtcgaggtt attctcccga agtacgactg tttgaaccaa agcagtgtaa gttgaagtac 7560 
tgtactacat aatctattca cttagtcttt aaaatttcaa ctcaaaatgc cacgaagctt 7620 
caactgaagc taaagaattc tgagctgcga tggagcgcag tagggtggca cagatcccaa 7680 
taaaccaata tatgaccaat aagggggtgc caagatcagt aggcactaat gaatttcctt 7740 
tgttttatat ccattataca ttattaatca agttacatct atttcaatgc aggtcaagga 7800 
tttacattta tatcaaagtt tttcttgggg tggtacagaa ataaaagtat gggttggacg 7860 
agtcgaagac ctgaccgttt acttcctgga acctcaaaat gggtatgaat cagctaatgt 7920 
atagtttttt ttgtgggaaa tgtatagttg agtgatataa aacatattac ttcttttcac 7980 
aaaattatta ggctagagcc ttgtactggt taataatgtg tacctttttc tcattcatat 8040 
aactacttat cgtagactat agaagccaat tagtaacaca atacattggc cttggcattc 8100 
caggctgaga gctagttata acaatgatat gtgagattag tggctctata accacttttg 8160 
agctaaagga atttgctgct agatgagcca atcaatccaa ctaattttaa attccatgat 8220 
caccctagga cacgcagcct gcacaaccaa gaacacagct aagatcatcg cgtgggcaca 8280 
aaaggttgtg cattaaggct aggccctggt cagtggctgt caaggactcc atggggctcc 8340 
ttacagtttt tattctgata tctcttgcgc ccatatgacg ctaccaaacg cttgtaacct 8400 
gtagcaaact attgccatct gtcactcaat gataaggtag acaatctttc ctttcccttt 8460 
aagatgttca acctttattt atgcttgagg atgcgtttga ttgtcaaatt tcagtttctc 8520 
tagattgcag acacacttgc acgtgctgtg tacaccttcc attatctggc atgggatttg 8580 
catttcaatt aagagaaata tgaaagaaag aaatgttatc acctgaatgt tagagcttaa 8640 
aaggcacaag caatcagcac catttatcaa aaataaatga tttacttgtc tagttgtctc 8700 
tttttggttc tcttcctgta agtggatgcc aatatctcaa gaactctcct gaggattttt 8760 
cttcacaacc tattcatttt gacatttcct tttctaggat gtttggcgtc ggatgtgtat 8820 
atggaaggaa tgatgaccgc agatttgggt tcttctgtca ttctgctctt gagtttatcc 8880 
tccagaatga attttctcca gtacgtatta tttagaatac tagctgctat attgactttt 8940 
tctttgtgag actacacttt cttgtttacc attccagtgc accatgttca aaatcttgta 9000 
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ttcagcgcgt tactttcagt ttctttacta ctagcttatt tggtgcattg gtgtttcctt 9060 
tcctactcta ctatctgaat gctacttgtg ttttcgcaac agttgcttct ttatcccctt 9120 
ccatttctca gttaaaaaaa cttgcatctg tattcacgtg acagcatata atacattgcc 9180 
atgattggtc aagtgctccg gtcgcctggc tatataagga acactattcc caatccagaa 9240 
tggcaagcac tcgggttgta tttaccatcc acaatcttga atttggagca cattatattg 9300 
gtaaagcaat gacatactgt gataaagcca caactgtgag tgccttactg tcttgtaatt 9360 
tttaatcttt ctgtttggcg cacagaaaat cttccacatt ttacagaatc atgttcttgt 9420 
gttttgtacg tattcaacta tttccaccca aacttttcag gtttctccta catattcaag 9480 
ggacgtggca ggccatggtg ccattgctcc tcatcgtgag aaattctacg gcattctcaa 9540 
tggaattgat ccagatatct gggatccata cactgacaat tttatcccgg taccagattt 9600 
tttcccagag tgcaagtaga tatataccaa ggccacagat agttttatgc ttaactatgt 9660 
gtttcatact acttcaggtc ccttatactt gtgagaatgt tgtcgaaggc aagagagctg 9720 
caaaaagggc cttgcagcag aagtttggat tacagcaaac tgatgtccct attgtcggaa 9780 
tcatcacccg tctgacagcc cagaagggaa tccacctcat caagcacgca attcaccgaa 98.40 
ccctcgaaag caacggacag gttcatcatc ccttgtgaac gaataaacat caaacgtttt 9900 
gtttataaaa agttgcttac tatttgtttt tgtttacttc aaaacaaaag tctgaaaatg 9960 
aagtgtttgg ttcctaggtg gttttgcttg gttcagctcc agatcatcga atacaaggcg 10020 
atttttgcag attggccgat gctcttcacg gtgtttacca cggtagggtg aagcttgttc 10080 
taacctacga tgagcctctt tctcacctgg tgagctccaa tatcctacac accatctagc 10140 
cagcccttca ttatgggagc tggagactac tttataattt aggttgatga tcgatcatgc 10200 
tgcagatata cgctggctcc gacttcatta ttgtcccttc aatcttcgaa ccctgtggct 10260 
taacacaact tgttgccatg cgttatggat cgatccctat agttcggaaa accggaggtg 10320 
tgtgactatt tctctccatt atgctgcact gatttgcata tgtcgagctg ttggacatga 10380 
aatggaaact atcctttggt atcgcaggac tttacgacac tgtcttcgac gtagacaatg 10440 
ataaggaccg ggctcggtct cttggtcttg aaccaaatgg gttcagtttc gacggagccg 10500 
acagcaacgg cgtggattat gccctcaaca gagcaagtat cgttcctcaa ttagccctga 10560 
attcagcagt agtgctaggt tatttacctt gcatgttcca tacctcattt cagagcaatc 10620 
ggcgcttggt tcgatgcccg tgattggttc cactccctgt gtaagagggt catggaacaa 10680 
gactggtcat ggaaccggcc cgcactggac tacattgaat tgtaccatgc cgctcgaaaa 10740 
ttctgacacc caactgaacc aatggcaaga acaagcgcat tgtgggatcg actacagtca 10800 
tacagggctg tgcagatcgt cttgcttcag ttagttccaa gcgcactgca gtcgtacata 10860 
gctgaggatc ctcttgcctc ctccaccagg gggaacaaag cagaaatgca tgagtgcatt 10920 
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gggaagactt ttatgtatat tgttaagatt ttccttttct tttccttccc tgcacctgga 10980 

aatggttaag cgcatcggca atataagaac cgcagtgaca ttttgtgagt agctttgtat 11040 

attctctcat cttgtgcaaa cttatgtgca tgctaggctc tctgatcatg tggaagcttt 11100 

gttatatgtt acttatggta tacatcaatg atatttacat ttgtggatga gctactgcac 11160 

ttggtttctg ctatctgttt tgtgaaatgg cagggccatg attatgcaga ttcactggtt 11220 

ctgaaacaga cacgctcctc taagctgtga ctgtgagctc tgaaaacagc attgttaaca- 11280 

tctattagta taaactaagg tacatcaacg gtgaagattt acgagctaaa ctccgtttgg 11340 

ttgtagacat tcactagaag tataagcgcg cttttctgcg ccgcctaggc tgcaatgatt 11400 

ttttttttat gtgtgtgtgg atatttcact atgacctgtg ggcaaaaggc tggccgagat 11460 

ttaggaagcg ctcaagcaat tggccaatgg gaaggtgccg gccctgatgg tttcacggcc 11520 

cagttcttgc gctcctgctg ggatatcatc aagggagatc gagaattccc gggatccgcg 11580 

gccgcgagct tccctatagt gagtcgtatt a 11611 

<210> 39 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 39 

Lys Val Gly Gly Leu Gly Asp Val Val Thr Ser 
1 5 10 



<210> 40 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 40 

Gly His Thr Val Glu Val lie Leu Pro Lys Tyr 
1 5 10 



<210> 41 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 41 

His Asp Trp Ser Ser Ala Pro Val Ala Trp Leu Tyr Lys Glu His Tyr 
15 10 15 



<210> 42 
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<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 42 

Gly He Leu Asn Gly He Asp Pro Asp He Trp Asp Pro Tyr Thr Asp 
1 5 10 .15 



<210> 43 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 43 

Asp Val Pro He Val Gly He He Thr Arg Leu Thr Ala Gin Lys Gly 
1 5 10 15 



<210> 44 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220>. 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 44 

Asn Gly Gin Val Val Leu Leu Gly Ser Ala 
15 10 



<210> 45 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 45 

Ala Gly Ser Asp Phe He He Val Pro Ser He Phe Glu Pro Cys Gly 
1 5 . 10 15 

Leu Thr Gin Leu Val Ala Met Arg Tyr Gly Ser 
20 25 



<210> 46 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 



<400> 46 

Thr Gly Gly Leu Val Asp Thr Val 
1 5 
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<210> 47 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 47 

Lys Thr Gly Gly Leu Gly Asp Val Ala Gly Ala 
1 5 10 



<210> 48 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 48 

Gly His Arg Val Met Val Val Val Pro Arg Tyr 
1 5 10 



<210> 49 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PEPTIDE 
<400> 49 

Asn Asp Trp His Thr Ala Leu Leu Pro Val Tyr Leu Lys Ala Tyr Tyr 
1 5 10 15 



<210> 50 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

.223> Description of Artificial Sequence: PEPTIDE 
<400> 50 

Gly lie Val Asn Gly lie Asp Asn Met Glu Trp Asn Pro Glu Val Asp 
15 10 15 



<210> 51 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PEPTIDE 
<400> 51 

Asp Val Pro Leu Leu Gly Phe lie Gly Arg Leu Asp Gly Gin Lys Gly 
1 5 10 15 
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<210> 52 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 52 

Asp Val Gin Leu Val Met Leu Gly 
1 5 



<210> 53 
<211> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 53 

Ala Gly Ala Asp Ala Leu Leu Met 
1 5 

Leu Asn Gin Leu Tyr Ala Met Ala 
20 



<210> 54 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



Sequence : PEPTIDE 

Thr Gly 
10 

Sequence: PEPTIDE 

Pro Ser Arg Phe Xaa Pro Cys Gly 
10 15 

Tyr Gly Thr 
25 

Sequence : PEPTIDE 



<400> 54 

Val Gly Gly Xaa Arg Asp Thr Val 
1 5 
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